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Applicant: Kent, et al. 

Serial No.: 09/710,633 

Filed: November 8, 2000 

Title: SYNTHESIS OF PROTEINS BY 
NATIVE CHEMICAL LIGATION 



Mail Stop Appeal Brief - Patents 
Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



Group Art Unit: 1654 
Examiner: Russel, J. 

OurRef.:TSRI478.0Con1 

Reply to Examiner's Answer 



Sir: 

This communication is a Reply to Examiner's Answer, mail dated 03/16/07. 
Attached hereto is an Evidence Appendix containing the following references: 

1. Dawson, P. E. et al., Science (1994), vol. 266, 776,779; 

2. Ueda, H., et al., J. Biol. Chem. (1997), vol. 272, 24966-24960; 

3. Wilken, J. et al., Curr. Opin. Biotech. (1998), vol. 9, 412-426; and 

4. Dawson et al., Annu. Rev. Biochem. (2000), vol. 69, 923- 
960. 



Also attached hereto is an Appendix of Related Proceedings providing the 
following cited cases: 

- Page 1 - 



Reply to Answer 
SN 09/710,633 



1. Fromson v. Advance Offset Plate, Inc., 219 U.S.P.Q. 1137, 
1140-1141 (Fed. Cir. 1983) 

2. in re Rasmussen, 211 U.S.P.Q. 323, 326-27. (CCPA 1981) 

Issues : 

The Examiner has combined Issues No. 1-2 and 5 and maintained these issues 
as a single issue. 

The Examiner has maintained Issue No. 3 as a separate issue. 

The Examiner has withdrawn Issue No. 4. 

Reply to Examiner's Answer regarding Issues 1-2 and 5 : 

Extrinsic Evidence of the Understanding of Persons Skilled in the Art: 

The Examiner alleges that the description of the present application discloses 
only a single example to support claims 11-14 and 32 that this example is insufficient to 
reasonably convey to one skilled in the art that the inventors had possession of their 
invention. 

The present application is based upon work that was subsequently published by 
the inventors in the scientific literature, viz., "Synthesis of Proteins by Native Chemical 
Ligation," Dawson, P. E. et al., Science (1994), vol 266, pages 776,779. Like the 
specification of the present invention, the Dawson reference discloses only a single 
example of the use of the claimed ligation method for synthesizing a protein having a 
variant amino acid, viz., [ala 33 ]IL-8 (see: page 78, Fig. 3, legend for part "A" and 
footnote 28). The only explanation in the Dawson reference for its choice of a mutant 
peptide is "convenience," see footnote 28. 
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The Dawson reference has been cited repeatedly by references that disclose the 
use of the process of claims 11-14 and 32 for making mutant proteins. These 
references provide extrinsic evidence that a disclosure of a scope similar to the 
disclosure of the present application is sufficient to teach persons skilled in the art that 
the ligation process of claims 11-14 and 32 can be employed for synthesizing a desired 
protein or domain thereof wherein the desired protein or domain thereof is "a derivative 
of a naturally isolatable protein that contains one or more variant residues that are not 
found in said naturally isolatable protein." 

For example, Ueda, H., et al. (J. Biol. Chem. 1997, 272, 24966-24960) discloses 
a mutant cytokine (SDF-1a analogue) and cites the Dawson reference (page 24066, 
second column, second paragraph, footnote (15)) as disclosing the chemical ligation 
method employed for synthesizing this mutant cytokine. Subsequently, Wilken, J. et al., 
(Curr. Opin. Biotech. 1998, 9, 412-426) published a detailed review of the use of this 
technology, including its use for making a large number of different classes proteins, 
including mutant proteins, see Table 2 page 416 and Figure 4, page 417. An even 
more extensive review was later provided by Dawson et al., Anna. Rev. Biochem. 2000, 
69, 923-960, see, in particular, pages 333-335 and Table 2 on page 941. 

Taken together, the above references provide extrinsic evidence that a 
disclosure of similar scope to the description of the present application is sufficient to 
impart to persons skilled in the art that the inventors had possession of the invention of 
claims 11-14 and 32. 

Judicial Support: 

The courts have held that a single example can be sufficient to support a genus. 
In re Rasmussen, 650 F.2d at 1214, 211 U.S.P.Q. at 326-27, held that a disclosure of a 
single example of a method of "adhereingly applying" one layer to another was 
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sufficient to support a generic claim to "adhereingly applying" because one skilled in the 
art reading the specification would understand that it is unimportant how the layers are 
adhered, so long as they are adhered. 

Claim 1 1 of the present application is directed to a method for ligating two 
peptides to form a derivative of a naturally isolatable protein having one or more variant 
residues not found in said naturally isolatable protein. Claim 32 of the present 
application is more narrow than Claim 11 and is directed to a method for ligating two 
peptides to form a desired protein or domain thereof that is a derivative of a naturally 
isolatable protein having one or more cysteine residues not found in said naturally 
isolatable protein. Although claims 11 and 32 differ slightly in scope, the application of 
Rasmussen to each of claims 1 1 and 32 is the same. However, the Rasmussen 
analysis of both claims must be divided into two instances, viz., Instance #1, wherein 
the variant residues or cysteines are not at the point of ligation; and Instance #2, 
wherein the variant residues or cysteines are at the point of ligation. 

With respect to Instance #1 , if the variant residues or cysteines are not at the 
point of ligation, one skilled in the art reading the specification would understand that it 
is unimportant with respect to the claimed ligation process whether or not variant 
residues or cysteines were introduced at points other than the point of ligation. A 
person skilled in the are would realize that the ligation process of claims 1 1 and 32 
would proceed equally well whether nor not variant amino acids or cysteines existed at 
points in the resultant protein other than the point of ligation. Accordingly, according to 
the rule of Rasmussen, a specification having only one example is sufficient to support 
the patentability of a generic claim with respect to ligation processes falling under 
Instance #1 . 

In Instance #2, the variant residues or cysteines are at the point of ligation. It 
should be noted that both claims 1 1 and 32 impose a number of limitations on the 
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amino acids at the point of ligation. Both Claims 1 1 and 32 require that the first 
oligopeptide include a C-terminal thioester and that the second oligopeptide include an 
N-terminus having an amino acid residue with an unoxidized sulfhydryl side chain and a 
free amino group capable of forming a P-aminothioester likage with the C-terminal 
thioester that rearrages to form an amide bond therein between. Hence, claims 1 1 and 
32 include both structural and functional limitations on the amino acid residues at the 
point of ligation. One skilled in the art, reading the specification, would understand that 
it is important for the operability of the ligation process to conform with these structural 
and functional limitations with respect to the amino acids at the point of ligation. 
However, given these limitations, one skilled in the art would also realize that it would 
be unimportant with respect to the operability of the claimed ligation process whether or 
not the amino acids at the point of ligation were variant residues not found in any 
naturally isolatable protein. Accordingly, in Instance #2, the introduction of variant 
amino acids must conform with and be consistent with other structural and functional 
limitations within claims 1 1 and 32 relating to the amino acids at the point of ligation. 
But given these limitations, the introduction of variants (in conformance with the 
limitations) would be understood by a skilled person to be unimportant to the operability 
of the claimed ligation process. By application of the rule of Rasmussen, the provision 
of one example in the specification is sufficient to support claims such as claims 1 1 and 
32 with respect to Instance #2. 

Examiner's Mischaracterization of Problem: 

The Examiner states that the example provided by the specification, viz., 
Example 4, pages 37-38 and Scheme 9, page 39, "does not provide support for the 
genus of protein derivatives recited in claims 11-14 and 32." (Examiner's Answer, page 
4, bottom paragraph) The Examiner has mischaracterized claims 11-14 and 32. 
Claims 11-14 are dependent claims that merely limit the products that can be produced, 
i.e., limitations are placed upon the type of "desired protein or domain thereof 
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produced by the claimed process. The process of claims 1 1-14 is unaltered by any of 
these limitations. 

Accordingly, the Examiner mischaracterizes Claims 11-14 when he states that 
they are directed to a "genus of protein derivatives." It would be more accurate to state 
that Claims 11-14 are all directed to the same process, with limitations on the product 
produced by said process. 

Placing limitation on the product does not alter the chemistry of the claimed 
ligation process. The kernel of the invention lies with the ligation process steps and not 
with particular products produced. The products produced by the process of Claims 11- 
14 and 32 are mere variants and are supported by the specification. 

Indeed, the variant products of claims 11-14 and 32 should not be considered 
limitations of the invention. 

"In general, a preamble limits the [claimed] invention if it 
recites essential structure or steps, or if it is 'necessary to 
give life, meaning, and vitality' to the claim." Fromson v. 
Advance Offset Plate, Inc., 219 U.S.P.Q. 1137, 1140-1141 
(Fed. Cir. 1983) 

A person skilled in the art would appreciate that the preamble limitations of 
claims 11-14 and 32 do not "recite essential structure or steps," and are not "'necessary 
to give life, meaning, and vitality' to the claim[s]." The Examiner complains that the 
specification does not elaborate the significance of Example 4. However, a person 
skilled in the art would not require an extensive explanation of Example 4 to appreciate 
that the inventors possessed an understanding that the claimed process could be 
employed to produce variant products without variation of the claimed process. 
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Reply to Examiner's Answer regarding Issue 3 : 
The Examiner states: 

"Clearly, in view of Appellants' disclosure, a cysteine residue 
is required to be present at the N-terminus of one of the 
oligopeptides being ligated in order for the reaction to 
proceed. However, Example 4 and Figure 9 never discuss 
why a cysteine residue is present at position 41." 
(Examiner's Answer, bottom of page 6 and top of page 9) 

Example 4 (page 37) discloses that the peptide corresponding to Sequence No. 
12, which includes the cysteine residue at position 41, undergoes a native chemical 
ligation reaction with the peptide corresponding to Sequence No. 11, to yield the 
product HIV-1 K41, corresponding to Sequence No. 15. The disclosure of Example 4 is 
sufficient to impart to a person of ordinary skill in the art that, in the context of the 
present application, the cysteine residue present at position 41 was a necessary 
functionality for the native chemical ligation reaction to proceed. 

Summary of Reply : 

The Board is requested to reverse the Examiner's final rejection of claims 11-14 
and 32 with respect to all issues. 



Respectfully submitted 




Donald G. Lewis 

Reg. No. 28,636 

The Scripps Research Institute 

10550 N. Torrey Pines Road TPC-8 

San Diego, CA 92037 

May 16, 2007 

(858) 784-2937 
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Court of Appeals, Federal Circuit 
Fromson 

v. Advance Offset Plate, Inc./Graphcqat, 
Inc./News Publishing Company of 
Framingham/Newspapers of 
New England, Inc. 

Nos. 83-850, 83-913, 83-914, 83-915 , 
Decided Nov. 8, 1983 

PATENTS 

1. Infringement — In general (§39,01) 

Infringement — Law or fact question 
(§39.60) 

Infringement issue raises at least questions 
of what is patented and whether what is 
patented haa been made, used, or sold by 
another; first is question of law, and second a 
question of fact. 

2. Court of Appeals for the Federal Cir- 

cuit — Weight given decision re- 
viewed (§26.59) 

District court's decision on infringement 
issue that turns on question of what is patent- 
ed is reviewed as matter of law. 

3* Construction of specification and claims 

— By specification and drawings — 
In general (§22.251) 

Patentee's verbal license augments difficul- 
ty of understanding claims, and to understand 
their meaning, they are construed in connec- 
tion with other parts of patent instrument and 
with circumstances surrounding patent appli- 
cation's inception. 

4. Specification — Theory or principle of 

invention (§62.9) 

Inventor need not comprehend scientific 
principles on which practical effectiveness of 
his invention rests. 

5. Construction of specification and claims 

— Comparison with other claims 
(§22.40) 

Significant evidence of particular claim's 
scope can be found on review of other claims. 

6. Claims — Article defined by process of 

manufacture (§20.15) 

Fact that process limitation appears in 
claim does not convert it to product by process 
claim. 



7. Patentability — Aggregation or combi- 

nation — Of old elements (§51.159) 

There is no basis for treating combinations 
of old elements, differently in determining 
patentability. 

8. Patentability — Invention ; — In general 

(§51.501) 

Analysis under 35 USC 103 for any 
claimed invention requires legal determina- 
tion of whether claimed invention as whole 
would have been obvious to one of ordinary 
skill in art at time it was made. 

9. Construction of specification and claims 

— By Patent Office proceedings 
(§22.151) 

Prosecution history, or file wrapper estop- 
pel doctrine is irrelevant when there is direct 
or contributory literal infringement. 

10. Construction of specification and 
claims — By prior art (§22.20) 

Claims are normally construed as they 
would be by those of ordinary skill in art. 

Particular patents — Printing plates 

3,181,461, Fromson, judgment holding 
claims 1, 4, 6, 7, 12, and 16 vacated. 



Appeal from District Court for the District 
of Massachusetts, Freedman, J.: 219 USPQ 
83. 

Consolidated actions by Howard A. From- 
son, against Advance Offset Plate, Inc.; 
Graphcoat, Inc., News Publishing Company 
of Framingham, and Newspapers of New 
England, Inc., for patent infringement, in 
which defendant counterclaims. From judg- 
ment of noninfringement, plaintiff appeals. 
Vacated and remanded. 

John E. Lynch, New York, N.Y., for 
appellant. 

Arthur F. Dionne, Windsor, Conn., for 
appellee. 

Before Markey, Chief Judge, Davis, Circuit 
Judge, and Nichols, Senior Circuit 
Judge.* 

Markey, Chief Judge. 

Appeal from four judgments of the U.S. 
District Court for the District of Massachu- 



* J u dge Nichols assumed senior status October 
1, 1983. 
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setts holding that claims 1, 4, 6, 7, 12 arid 16 
of U.S. Patent 3,181,461 issued to Fromson 
are not infringed or contributorily infringed. 
We vacate and remand. 

Background 

A. The Technology 

The Fromson patent involves a process for 
making a photographic printing plate for use 
in the art of lithography. The art involves 
creation on a printing surface of certain areas 
that are hydrophilic (water attracting) and 
organophobic (ink repelling) and other areas 
that are organophilic (ink attracting) and hy- 
drophobic (water repelling). 

At the time of the Fromson invention, the 
state of the art was depicted generally by U.S. 
Patent 2,714,006, issued on July 26, 1955 to 
Jewett and Case (Jewett). Jewett teaches the 
preparation of a presensitized lithographic 
plate by: first treating the surface of an alu- 
minum sheet with an aqueous solution of an 
alkali metal silicate to form a water insoluble, 
hydrophilic, siliceous, organophobic surface 
layer; treating that layer with a diazo com- 
pound to form a light-sensitive, water soluble, 
diazo coating; and exposing portions of the 
coatea* plate to light through a negative or 
stencil, thus causing the exposed portions to 
become water insoluble, hydrophobic, organo- 
philic image areas. The plate is then washed 
with water to remove the water soluble diazo 
portions that were not exposed to light, there- 
by exposing the water insoluble, hydrophilic, 
organophobic, siliceous surfaces in their place 
(non-image areas). An image developer or 
printer's developing ink is poured on the plate 
and the^excess wiped off, making the image 
areas plainly visible. The plate is then ready 
for mounting on a press, successive treatments 
with water and ink, and printing. In this 
process, the image areas absorb ink while the 
non-image areas repel it. 

B. The Fromson Patent 

In the 1950's, Fromson was in the business 
of selling metals and began, through Ano- 
Coil Corporation, to manufacture arid sell 
anodized aluminum. In anodization, alumi- 
num is coated with oxide while it is the anode 
in an electrolytic bath wherein it is subjected 
to an electric current, whence the term "anod- 
ized." The anodized aluminum was used in 
articles such as television antennas, furniture 
tubing, and nameplates. 

Fromson, with no background in lithog- 
raphy, conceived of using anodized aluminum 
as a replacement for non-anodized aluminum 



in the plate taught by Jewett His invention 
according to the Fromson patent improves the 
Jewett plate in a number of ways. It enables 
use in preparation of the plate of light-sensi- 
tive compounds other than diazo compounds. 
It enables the coating to absorb nitrogen- 
containing materials released by the light- 
sensitive compounds when exposed to light. 
Also, as the district court found, the Fromson 
plate enjoys improved corrosion resistance 
and a longer press life. 

Fromson filed his application for patent in 
May 1963, and the patent issued in May 
1965, containing eleven product and five pro- 
cess claims. Claim 1 is representative of the 
product claims: 

1. A sensitized photographic printing plate 
comprising an aluminum sheet having a 
surface which has been treated to form an 
aluminum oxide coating on said surface, a 
water-insoluble, hydrophilic, organophobic 
layer on said sheet resulting from the reac- 
tion of the aluminum oxide coating and an 
alkali metal silicate applied to said coating, 
and a light-sensitive coating over said layer 
[e.g., diazo resin] having one solubility in 
relation to a solvent in a state before expo- 
sure to light and another solubility in rela- 
tion to said solvent in another state after 
exposure to light, said light-sensitive mate- 
rial being soluble in said solvent in one of 
said states and being insoluble in said sol- 
vent and in water, hydrophobic and organ- 
ophilic in its other state. 

Claim 12 is the sole independent process 
claim: 

12, The process of making a sensitized 
photographic plate comprising the steps of 
applying to an aluminum sheet having a 
coating of aluminum oxide, a water-solu- 
tion of an alkali metal silicate to cause the 
silicate, to react with the aluminum oxide to 
form a water-insoluble, hydrophilic, organ- 
ophobic layer on said sheet, drying the 
layer, and applying over the dry layer a 
light-sensitive coating having one solubility 
in relation to a solvent in a state before 
exposure to light and another solubility in 
relation to said solvent in another state 
after exposure to light, said light-sensitive 
material being soluble in said solvent in 
one of said states and being insoluble in 
said solvent and in water, hydrophobic and 
organophilic in its other state. 

C. The Advance Plate 

After issuance of his patent, Fromson's 
invention enjoyed extensive commercial suc- 
cess and was the subject of licensing agree- 
ments with several companies. Fromson sued 
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Advance Offset Plate, Inc. ("Advance"), 
charging it with infringement and contribu- 
tory infringement of product claims 1, 4, 6 
ana 7, and process claims 12 and 16. He also 
sued three Advance customers for direct in- 
fringement in actions consolidated with that 
against Advance and now consolidated on 
appeal. We assume for purposes of this ap- 
peal that the claims at issue in those three 
cases are the same as those asserted against 
Advance. 

It is undisputed that Advance manufac- 
tured and sold "wipe-on" plates comprising 
an> anodized aluminum sheet that had been 
treated with an aqueous solution of sodium 
silicate, and that its customers applied a diazo 
coating to those plates. Because the claims 
include the application of a diazo coating or 
other light sensitive layer and because Ad- 
vance's customers, not Advance, applied the 
diazo coating, Advance cannot be Hable for 
direct infringement with respect to those 
plates but could be liable for contributory 
infringement. It is also undisputed that Ad- 
vance, not its customers, applied the diazo 
resin to certain "presensitized" plates. For 
those presensitized plates, Advance could be 
liable for direct infringement. 

Though preparation of the Advance plate 
involves treatment of anodized aluminum 
with an aqueous solution of alkali metal sili- 
cate to yield a water-insoluble, hydrophilic 
layer, as set forth in the claims, Advance and 
its customers deny infringement on the sole 
ground that there is no "'reaction" between 
the aluminum oxide and sodium silicate. It is 
undisputed that Advance and its customers do 
what the claims say, i.e., apply a water solu- 
tion of an alkali metal silicate to an oxide 
coated aluminum sheet to produce a layer, 
but it is argued that the layer does not result 
from a "reaction" as we are asked by Ad- 
vance to define that term. 

During oral argument on appeal, Advance 
suggested; that its process involves different 
conditions (e.g., temperature, time) than the 
Fromson process, an argument neither raised 
in the brief nor addressed by the district 
court. Though the argument cannot therefore 
be considered here, we caution that the assert- 
ed claims contain no temperature or time 
limitations, and that no basis appears on this 
record for limiting the claimed inventions to 
preferred embodiments or specific examples 
m the specification. See, Smith v. Snow, 294 
U.S. 1,11,24 USPQ 26, 36 (1935). 

D. District Court's Decision 

By Memorandum and Order dated May 
30, 1980, the district court separated the 



issues of infringement and invalidity from 
other defenses and counterclaims. On Febru- 
ary 11, 1983, after trial without a jury, the 
district court found no infringement of "the 
patent." Because the finding could apply 
properly only to the six claims at issue, we 
review the decision as if only the six claims 

were involved. . F. Supp 219 

USPQ 83 (D. Mass. 1983). The district court 
did not resolve the validity issue, noting that 
the patent had expired. If infringement dur- 
ing the period before expiration be found, the 
validity issue would of course become viable. 

The district court held that the claims of 
the Fromson patent, when read in light of the 
specification and prosecution history, must be 
interpreted as requiring that the water-in- 
soluble, hydrophilic, organophobic layer be 
the product of a chemical reaction between 
the aluminum oxide coating and the alkali 
metal silicate,- and that that product be a 
compound having physical properties differ- 
ent from those of its constituents. Finding no 
such reaction in preparation of Advance's 
plates, the district court entered a judgment of 
noninfringement in the four consolidated 
actions. 

The district court considered unpersuasive 
three laboratory tests and testimony of From- 
son s expert, based on those tests, that Ad- 
vance's plate haii a surface layer resulting 
from reaction of sodium silicate and anodized 
aluminum. It considered persuasive the tests 
and testimony of Advance's expert, who con- 
cluded that there is no reaction product of 
aluminum oxide and silicate on Advance's 
plate, but that the surface is coated with 
silica, an insoluble, hydrophilic material. 
Resolution of that conflicting testimony is not 
necessary, however, where, as here, a legal 
conclusion establishes its irrelevance. Indeed, 
we accept for this opinion that the Advance 
plate involves the formation of silica. 

The district court made no finding on 
whether silica is organophobic. In response to 
request for admission number 29, Advance 
stated that it employs a sodium silicate coat- 
ing believed hydrophilic and organophobic. 
The district court found that silica (not sodi- 
um silicate) is formed on Advance's plate. On 
remand, a finding of infringement will re- 
quire a determination of whether the layer on 
Advance's plates is water insoluble, hydro- 
philic and organophobic. 

The district court made no finding on the 
identity of the layer produced by the process 
described in the Fromson patent as the pre- 
ferred process embodiment (i.e., 1 55-21 0°F. 
and 1-10 minutes of treatment time). At trial 
Advance's expert said that the Fromson pro^ 
cess does not produce aluminosilicate within 
those time and temperature parameters. He 
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also said/ however, that although silica was 
first formed when he treatei.aluminum oxide 
crystals v with an ac-ueous solution of ah alkali 
metal silicate within those parameters, alum- 
inosilicate, could form after treatment lasting 
one hour. , 

E. Reissue Proceeding 

On March 29, 1979, after the four suits 
were -filed; Fromson filed : for reissue in the 
Patent and Trademark Office ("PTO") un- 
der a procedure which then allowed for re- 
issue proceedings and reexamination in light 
of new prior art. Advance participated as a 
protestor in accordance with PTO rules. Be- 
fore the judgment in the district court was 
rendered, the PTO again concluded that 
Fromson's claims were patentable. 

/ritttf ■" 

Whether the district court erred in finding 
no infringement or contributory infringement 
of claims 1, 4, -6, 7, 12, and 16. 

. Opinion 

[1,2,] The issue of infringement raises at 
least two questions: (1) what is patented, and 
(2) has what is patented been made, used or 
sold by another. SSlH Equipment S.A.'v. 
USITC, 713 &2d 746, 758, 218 VSPQ 678, 
688 (Fed. Gir. J983), The first is a question 
of law; the second a question of fact. Id.; 
Kalman v, Kimberly-Clark Corp.; 713 F.2d 
760, 771, 218 USPQ 781, 788, 789 (Fed. Cir. 
1983). The present decision of the district 
court turned on the first question, which we 
review as* a matter of law. 

A. Contentions of the parties \- 

w Advance and its customers contend that 
"the Court was totally convinced, based on 
the evidence presented, that Fromson was 
indeed referring to and claiming a 'reaction 
product* formed by the reaction of an alumi- 
num oxide with sodium silicate, i.e., an 
aluminosilicate compound." We agree that 
the district court so interpreted the claims. 

Fromson argues, that "reaction" in the 
claims should be interpreted to cover the 
claimed treatment of an oxide coated alumi- 
num sheet with an aqueous solution of alkali 
metal silicate to, form a water insoluble, hy- 
drophilic, organophobic layer on the sheet, 
and that whether the layer is an aluminosili- 
cate compound is irrelevant, there being no 
reference to any such compound in the assert- 
ed claims. We agree. 



219 USPQ 



B. Claim Construction — The Specification 

[3] In Autogiro Co. of America v. United 
States, 384 F.2d 391, 397, 155 . USPQ 697, 
702 (Ct. CI. 1967), our predecessor court 
recognized that patentees are not confined to 
normal dictionary meanings: 
The dictionary does not always keep 
abreast of the inventor. It cannot. Things 
are not made for the sake of words but 
words for things. To overcome this lag, 
patent law allows the inventor to be his 
... own lexicographer. (Citations omitted.) 
A patentee's verbal license "augments the 
difficulty of understanding the claims," and to 
understand their meaning, they must be con- 
strued "in connection with the other parts of 
the patent instrument and with the circum- 
stances surrounding the inception of the pat- 
ent application;" Id. Accord, General Electric 
Co. v. United States, 572 F.2d 745, 751-53. 
198 USPQ 65, 70-73 (Ct. CI: 1978). 

This appeal hinges on construction of "re- 
action," The specification discloses a new arid 
improved method of forming plates for use in 
lithography. Fromson discovered that the 
treatment of ariodized aluminum with an 
aqueous solution of water soluble alkali nietal 
silicate produces a water insoluble, hydro- 
philic, organojphobic layer on the aluminum, 
a layer having exceptional lithography-relat- 
ed properties. Fromson's invention included 
the formation of the layer, not its exact struc- 
ture. Though Fromson referred to the dis- 
closed treatment as involving a "reaction/' he 
also referred to it in the specification as an 
"application" and as "adsorption." Not all 
references to "reaction" were accompanied by 
a reference to formation of an aluminosilicate. 

[4] Fromson did theorize that his new, 
improved layer was an aluminosilicate be- 
lieved "to be in the nature of a commercial 
zeolite," having "properties of a molecular 
sieve," but expressed that theory as merely a 
"belief." There is no. basis or warrant for 
incorporating that belief as a limitation in the 
claims. It is undisputed that inclusion of 
Fromson's theory and belief was unnecessary 
, to meet the enablement requirement of 35 
U.S.C. §112 (that a patentee describe how to 
make and use the invention). Moreover, it is 
axiomatic that an inventor need not compre- 
hend the scientific principles on which the 
practical effectiveness of his invention rests. 
See, eg'., Diamond Rubber Go. v. Consolidat- 
ed Rubber Co., 220 U.S. 428, 435-36 (1911). 

C Claim Construction — The Claims 

[5] Significant evidence of the scope of a 
particular claim can be. found on review of 
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other claims. General Electric v., United 
States, supra, 572 F.2d at 752, 198 USPQ at 
70. Here, claim 5 (not asserted) limits the 
layer described in claim 1 to "an aluminosili- 
cate structure in the nature of a zeolite molec- 
ular sieve", i.e., to Fromson's theory of what 
is formed. In Kalman v. Kimberly-Clark 
Corp., supra, 713 F.2d at 771, 218 USPQ at 
788, this court said "where some claims are 
broad and others narrow, the narrow claim 
limitations cannot be read into the broad 
whether to avoid invalidity or to escape inr 
fringemeht." Accord, Environmental Designs, 
Ltd. v. Union Oil Co. of California, 713 F.2d 
693, 700, 218 USPQ 865, 871 (Fed, Cir. 
1983); Caterpillar Tractor Co. v. Berco, 

S.P.A., 714. F.2d 1110, , 219 USPQ 

185, 188 (Fed. Cir. 1983). The aluminosili- 
cate limitation of narrow claim 5 cannot, 
therefore, be read into broaderxlaim i. 

[6,7,8] Advance and , its , customers argue 
that, had appellant originally sought product 
by process claim's,' those claims would have 
been disallowed because each claimed element 
would have been old, citing the district court's 
finding 18 containing "combination of old 
element" terminology. This argument is 
without merit. First, .it is a hypothetical. That 
a process limitation appears in a. claim does 
not convert it to a product by process claim. 
Second, ;there is no basis for; treating combina- 
tions of old elements differently in determin- 
ing patentability. The analysis under 35 
U.S.C. §103 for any claimed invention re- 
quires a legal determination of whether the 
claimed invention as a whole would have 
been obvious to one of ordinary skill in the art 
at the time it was made. See, e.g., Environ- 
mental Designs, Ltd. v. Union Oil of Califor- 
nia, supra; Stratoflex, Inc. v. Aeroquip Corp.. 
713 F.2d 1530, 218 USPQ 271 (Ftd. Cir. 
1983). 

■ * , ■ ■ ■ ■ > " , 

D. Claim Construction — Prosecution 
History 

The district court noted some of Fromson's 
arguments during prosecution of his applica- 
tion, in which he stressed the importance of 
"reacting" anodized aluminum with alkali 
metal silicate. However, whether the interac- 
tion of these two materials was a "reaction" 
or something else was immaterial to consider- 
ation of the prior art, It does not appear, 
moreover, that Fromson used "reacting" in 
his arguments any differently than he had in 
the specification and claims, i.e., to describe 
what he believed the interaction was between 
oxide coated aluminum and an aqueous solu- 
tion of alkali metal silicate. Thus, Fromson's 
arguments focused on the fact of an interac- 



tion and production of a new layer with 
particular properties, not on the specific na- 
ture of the interaction or on any chemical 
structure of the layer. 

That Fromson speculated, on one page of a 
response to a rejection, that the reaction layer 
is ■ believed to be in the nature of a commer- 
cial zeolite" is of no moment, in view of the 
total absence from the other thirteen pages in 
that response of any reference to formation of 
an aluminosilicate or zeolite, and in view of 
his clear labeling of the zeolite statement as a 
"belief." Instead, Fromson referred in those 
thirteen pages to an aluminum oxide-sodium 
silicate reaction surface or layer, indicating 
that he did not know, and did not care, what 
the "reaction" or the structure of the resulting 
product might be. 

[9] Advance and its customers argue that 
the "prime issue" in this appeal is the doc- 
trine of prosecution history ("file wrapper") 
estoppel. That doctrine, however, is inappli- 
cable here. If there be literal infringement, 
direct and contributory (as there may be here 
under a proper construction of the claims), 
the doctrine is irrelevant. Even if it were 
applicable, an examination of a prosecution 
history demonstrating that "reaction" was 
merely a theoretical label having no influence 
on the patentability of the claimed invention 
as a whole demonstrates the absence of pros- 
ecution history estoppel. 



. E. . Claim Construction 
Considerations 



Other 



That "reaction" in the claims need not be 
confined to production of an aluminosilicate is 
consistent with the dictionary definition. That 
in Webster's New Collegiate Dictionary 
(1974) includes both "chemical transforma- 
tion or change," and "interaction of chemical 
entities," which are consistent with the defini- 
tions appearing in Hackh's Chemical Dictio- 
nary (1969) and the American Heritage Dic- 
tionary (1970); 

. When an oxide coated aluminum surface is 
; contacted with an aqueous solution of water 
soluble alkali metal silicate, chemical change 
occurs in at least two ways. First, ions or 
other chemical units in solution have some- 
how interacted to form a solid structure. Sec- 
ond, the water insoluble solid structure, what- 
ever may be its precise nature (e.g., silica or 
aluminosilicate), is not identical to the water 
alkali metal silicate and oxidized aluminum 
that interacted to produce it. Moreover, there 
is clearly present an "interaction of chemical 
entities. 

[10] The foregoing is fully consistent with 
long-standing use of "reaction" in the lithog- 
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raphy art. Claims are normally construed as 
they would be by those of ordinary skill in the 
art. See e.g., Schenck v. Noirtron Gorp., 713 
F.2d 789, 786-787, 218 USPQ 698, 701-02 
(Fed. Cir, 1983). Jewett interchangeably uses 
terms such as "treating," . "treatment, and 
"react," to describe a lithographic plate pro- 
ducing process. Jewett's claims use "react- 
ing," "treatment," and "reaction product." 
Jewett makes no attempt to define the struc- 
ture of the layer there disclosed (as an alu- 
minosilicate compound or otherwise), al- 
though it does mention the hydrophilic layer 
as being chemically bonded to the aluminum 
surface. Jewett refers to the layer as "silicate 
treatment," as "silicate or silicon containing" 
film, or as "an inorganic material such as 
silicate." It is not unreasonable to conclude 
that one of ordinary skill in the lithography 
art would interpret "react" in Frpmsonlb 
mean the same thing it appears to mean in 
Jewett, i.e., the treatment of a metal substrate 
with an aqueous solution to yield a layer, 
regardless of the chemical structure of the 
layer or the proper label for the phenomena 
that produced it. 

Conclusion 

We hold, therefore, that the district court 
erred as a matter of law in interpreting the 
claims as limited to the product of a chemical 
reaction producing a new chemical compound 
in the restrictive sense of those terms. There 
having been no finding by the district court 
that the accused plates have a layer that is 
water insoluble, hydrophilic and organopho- 
bic, the determination of infringement and 
contributory infringement must await com- 
plete findings in the first instance by the 
district court. 

Decision 

The four judgments based on findings of 
noninfringement of Fromson's asserted claims 
are vacated, and the cases are remanded for 
further consideration consistent with this 
opinion. 

Vacated and remanded. * 
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Court of Appeals, Federal Circuit 

The Young Engineers, Inc. (aka TYE or 

TYE, Inc.,) 
v. United States International Trade 

Commission 

No. 83-649 
Decided Nov. 8, 1983 

UNFAIR COMPETITION 

1. Importation restrained under Tariff Act 

(§68.60) 

Exclusion order and cease and desist order 
are alternative remedies. 

2. Importation restrained under Tariff Act 

(§68.60) 

There is no authority in 19 USC 1337(h) 
to support action of ITC that changed its 
orders after twelve-month period. 

3. Importation restrained under Tariff Act 

(569.60) 

While Commission determinations are not 
final for purposes of appeal to CAFC until 
review period has run, they are otherwise 
effective upon publication in Federal Regis- 
ter, 19 USC 1337(g)(2); during Presidential 
review period, products are in fact- excluded 
from entry except under bond; thus, Commis- 
sion orders are "effective"* before they are 
"final," not "final" and then "effective." 

4. Importation restrained under Tariff Act 

(§68.60) 

19 USC 1337(h) relates to order in effect 
and how long it will "continue in effect"; 
moreover, this section recognizes that order 
disapproved by* President under Section 
1337(g) is no longer in effect at earlier date; 
there is no authority in Section 1337(h) to 
issue exclusion orders but only to end effec- 
tiveness of outstanding order where conditions 
that led to such exclusion from entry or order 
no longer exist; 5 CFR 211.57 does not pro- 
vide greater authority than statute. 

5. Importation restrained under Tariff Act 

(§68.60) 

"Determination" in 19 USC 1337(b) 
means only conclusion that there is violation 
or no violation of Section 1337(a); although 
word "determination" appears in other para- 
graphs of Section 1337, its meaning is con- 
fused rather than clarified by comparison of 
various clauses in which it appears. 

6. Importation restrained under Tariff Act 

(§68.60) 

19 USC 1337(b)(1) time limitations are 
satisfied if Commission completes its investi- 
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utile per' inutile non-vitialtur might ap- 
propriately be applied here. 

Court where the verbiage of synergism is 
used. 1 The maxim utile* per inutile non 
yitiatur might appropriately be applied here. 

Moreover; T should have been content to 
continue the present ambiguous stance with 
respect to synergism described in note 17 of 
the majority opinion,, rather than to stand 
up and be counted in the current synergism 
controversy. 2 

Perhaps the word synergism should be 
discarded , (except for its original meaning 
with respect to the interaction of chemicals 
or drugs, and for its use as a fashionable fad 
in television commercials, and for its 
theological and scriptural overtones). 3 But 
abandoning the verbal trappings and 
11 rhetoric of synergism" must not cause 
courts to overlook the importance of the re- 
quirement of novelty and invention, long 
required by the patent statutes and the 
Constitution. 

As/ pointed out in John Deere itself, 
Congress may nbt "enlarge the patent 
monopoly, without regard to the innovation 
advancement or social benefit gained 
thereby. . Moreover, Congress may not 
authorize the issuance of patents whose 
effects are to remove existent knowledge 
from the public domain, or to restrict free 
access to materials already available. In- 
novation, advancement, and things which 
add tp the §um of useful knowledge are in- 
herent requisites in a patent system which 
by constitutional command must 'promote 
the Progress of * * 1 * useful Arts/ This is 
the standard expressed in the Constitution 
! and it may riot be ignored. " 383 U.S. at 6, 
148 USPQ at 462. 

' - Similarly, this Court has said: "Thus, the 
.courts, in determining obviousness in a com- 
bination patent, must undertake the tripar- 
tite Graham inquiry without losing sight of 
the necessity to determine whether the 
device performs its function in an innovative 
fashion;" 608 F.2d at 91, 203 USPQ at 
.965-966, . 

It must never be forgotten that, the power 
given to Congress by Art. I, sec. 8, cl. 8 of 
the Constitution is "To promote the Progress of 
Science and useful Arts, by securing for 
limited Times to Authors and Inventors the 
exclusive Right, to. their respective Writings 
and Discoveries" [Italics supplied] The 
primary policy of the patent laws is to 
promote invention for the benefit ; of the 
public. The private gain enjoyed by the 
patentee is secondary; the "exclusive Right" 



conferred by the patent monopoly is merely 
the means of accomplishing the intended 
result of advancing the growth of science by 
adding to the sum of human knowledge. A 
patent cannot be sustained which would 
withdraw or subtract from what is already 
known and practiced. Borden Co. v. Clear- 
field Cheese Co., 244 F.Supp. 366, 368, 146 
USPQ 660, 661 (W.D. Pa. 1965). To fence 
in by a newly created monopoly elements 
previously available to the public (by 
aggregating them in a combination patent 
without any inventive innovation) would be 
contrary to public policy and fundamental 
principles of patent law. 

To emphasize the importance . of these 
constitutional aspects of bur patent system, 
whether or not they are clothed in "the 
rhetoric!' of synergism," it seemed proper to 
dwell upon them specifically in this con- 
curring opinion when joining in the judg- 
ment of the Court. '. 



Court of Customs and Patent Appeals 



In re Rasmussen 

No. 81-516 * ' 

Decided June 4, 1981 

PATENTS 

1. Amendments to patent application — 

New matter (§13.5) 

Claims — Specification must support 
(§20.85) 

35 U.S.C. 132 prohibits introduction of 
new matter into disclosure of application; 35 
U.S.C. 112, first paragraph, requires that 
claim language be supported in specifica- 
tion. 

2. Amendments to patent application — 

New matter (§13.5) 

Claims — Broad or narrow — In 
general (§20.201) 

Specification — Claims as disclosure 
(§62.3) 

Broadening claim does not add new 
matter to disclosure; disclosure is that 
which is taught* not that which is claimed; 
original claim is part of disclosure at time of 
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filing; consideration of original claim as 
evidencing support in disclosure for later 
submitted claims does not warrant employ- 
ment of 35 U.S.G. 132 as basis for rejection 
of- later submitted claims on ground that 
latter are adding new matter to original 
claim portion of disclosure; to do so would 
render Section 132 redundant in light of 
Section 112, first paragraph; applicant is en- 
titled to claims as broad as prior art and his 
disclosure will allow. 

*3. Amendments to patent application — 
New matter (§13.5) 

Claims — Specification must support 
(§20.85) 

Pleading and practice in Patent Office 
. — Rejections (§54.7) 

Proper basis for rejection of claim amend- 
ed to recite elements thought to' be without 
support in original disclosure is 35' U.S.G. 
112, first paragraph, not Section 132; latter 
section prohibits addition of new matter to 
original disclosure; it is properly employed as 
basis for objection to amendments to 
abstract, specifications, or drawings attemp- 
ting to add new disclosures to that originally 
presented; past opinions of Court of 
Customs and Patent Appeals, in cases in 
whiph. Section 132 claim rejection was 
revievved on Section 112 analysis, should not 
in future be viewed as having approved 
employment of Section 132 as basis for 
claim rejection; amended claims involved in 
those cases should have been rejected under 
Section 112, first paragraph; claim rejec- 
tions in those cases could then have been ex- 
plicitly affirmed or reversed on. direct 
applications of Section 112, rather than on 
Section 112 analyses applied to Section 132 
rejections; similarly, rejections of claims for 
lack of support when required in reissue 
applications should be made under Section 
112, first paragraph, rather than under new 
matter prohibition of 35 U.S.C. 251; accor- 
dingly, such cases are overruled insofar as 
they approved rejection of claims under Sec- 
tion 132. 

4. Claims — Broad or narrow* — In 
general (§20.201) 

Reissue — In general (§58.1) 

Fact that claim may be broader than 
specific embodiment disclosed in specifica- 
tion is in itself of no moment; statutory 
provision for broadened claims in reissue 
applications is intended to meet precisely 
situation in which patentee has claimed less 
than he had right to claim. 



5. Pleading and practice in Patent Office 
— In general (§54.1) 

Specification — Sufficiency of dis- 
closure (§62.7) 

35 U.S.C. 112 requires disclosure of only 
one mode of practicing invention; insistence 
upon boilerplate recitation in specification 
that specific embodiment shown was not 
meant to limit breadth of claims, or that ex- 
ample given was only one of several methods 
that could be employed, is exaltation of form 
over substance. 



;. Appeal from Patent and Trademark Of- 
fice Board of Appeals. 

Application for reissue of patent of Max 
Otto Henri Rasmussen, Serial No. 884,775, 
filed Mar. 8, 1978, for reissue of Patent No. 
3,963,549, issued June 15, 1976. From deci- 
sion affirming rejection of claim 6, applicant 
appeals. Reversed; Nies, J., dissenting. 

George Vande Sande, Washington, D.C., 
for appellant. 

Joseph F. Nakamura (Robert D. Edmonds, 
- of counsel) for Patent and Trademark Of- 
fice. 

Before Markey, Chief Judge, ' and Rich, 
Baldwin, Miller, and Nies, Associate 
Judges. 

Markey, Chief Judge. 

The decision of the Patent and 
Trademark Office Board of Appeals (board) 
a/firming the rejection of claim 6 under 35 
USC 132 is reversed. . 

Background 

Appealed claim 6 is contained in reissue 
application S.N. 884,775 filed March 8, 
1978.' Original claim 6 in the reissue 
application was directed to a method of 
manufacturing a thermal insulating 
member. 2 Rasmussen described in his 
specification the steps of applying adhesive 
to one side of a tubular plastic film, winding 
the film around two spaced drums, and, 
when the desired number of layers have 
been wound, cutting the film layers 



.' That application seeks reissue of U.S. Patent 
3,963,549, issued June 15, 1976. 

1 6. A method of manufacturing a thermal in- 
sulating member from a thin film of plastic 
material and comprising a pair of spaced op- 
posing generally parallel sidewalls- which are 
bridged by a plurality of spaced transverse walls 
comprising the steps of:! 
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transversely to the direction of winding. 
When the film units are unfolded and ex- 
tended, a plastic laminate in one of the 
forms shown below results: 




Viewing the particular method of adher- 
ing the layers of tubular film as immaterial, 
Rasmussen later amended claim '6, inter 
alia, by substituting "adheringly applying'* 
for language specifying use of adhesives. 3 



winding a continuous length of a tube formed of 
the plastic material in its flattened state into a 
generally cylindrical member whose cir- 
cumference corresponds to the desired width of 
the insulating member, 

applying to the flattened ' tube a band of ad- 
hesive of predetermined width corresponding sub- 
stantially to the width of said spaced transverse 
walls and thus to the desired spacing between 
,,■ the sidewalis ~ said application of adhesive oc- 
curring prior to the contacting of the flattened 
tube during the winding step to the portion of 
the tube already wound onto the cylindrical 
member so that successive ' layers of said 
tubular member on said cylindrical member 
adhere to each other along the predetermined 
width of adhesive application, 

terminating the winding of the plastic tube onto 
the cylindrical member when' a predetermined 
number of layers of the plastic tube has been 
wound thereon corresponding to the desired 
length of the thermal insulating member, 

and cutting the superimposed assemblage of 
successively adhering layers of the plastic tube 
in a direction transverse to the longitudinal 
direction of the tube, 

said assemblage when longitudinally 
straightened and then extended in a direction 
transverse to the longitudinal direction of the 
tube so as to expand the successively joined 
tubes forming said thermal insulating member 
[Emphasis added.] 

1 Amended claim 6 reads as follows: ' 
6. A method of manufacturing a thermal in- 
sulating member from a thin film of material 
and comprising a pair of spaced opposing 
generally parallel sidewalis which are bridged 
by a plurality of spaced transverse walls com- 
prising the steps of: 

winding a continuous length of a tube formed of 
the material in its flattened state onto a 
generally cylindrical member whose cir- 
cumference corresponds to the desired width of 
the insulating member, each successive layer of 



The examiner rejected the amended 
claim, saying "adheringly applying" was 
new matter 7 ' prohibited by §132, explain- 
ing that limitation of the scope of the 
original disclosure to use of adhesives meant 
that allowance of the broader claim would 
be an, enlargement of the scope of the dis- 
closure. 

The board affirmed, saying Rasmussen 's 
application disclosed only one embodiment 
(applying adhesive to join, the sheets) and 
that broadening the scope of the claim add- 
ed new matter to the application. 

Issue 

The issue presented is whether amended 
claim 6 was properly rejected under 35 USC 

Opinion 

[1] Confusion is generated when related 
but distinct statutory provisions are treated 
as interchangeable. Section 132 prohibits 
the introduction of new matter into the dis- 
closure of an application. Section 112, first 
paragraph, requires that claim language be 
supported in the specification. This court, 
having said that a rejection of an amended 
claim under § 132 is equivalent to a rejection 
-under § 112, first paragraph, for lack of sup- 
port, appears to have contributed to the 
treatment of those separate statutory sec- 
tions as interchangeable. See In re Hogan, 
559 F;2d 595, 608, 194 USPQ 527, 539 (CC- 



said tube being wound to overlie the im- 
mediately preceding layer to provide thereby a 
generally cylindrical band of tubular layers ex- 
tending axially along said cylindrical member a 
distance corresponding substantially to the 
flattened width of said tube, 

adheringly applying the flattened tube during 
the winding step to the portion of the tube already 
wound onto the cylindrical member over a band 
of predetermined width corresponding substantial- 
ly to the desired width of said spaced transverse 
walls and thus to the desired spacing between 
said sidewalis, 

terminating the winding of the tube onto the 
cylindrical member when a predetermined 
number of layers of the tube has been wound 
thereon corresponding to the desired length of 
the thermal insulating member, 

and cutting the superimposed assemblage of 
successively adhering layers of the tube in a 
direction transverse to the longitudinal direc- 
tion of the tube, 

said assemblage when longitudinally 
straightened and then extended in a direction 
transverse to the longitudinal direction of the 
tube so as to expand the successively joined 
tubes forming said thermal insulating member. 
(Emphasis added.) 
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PA 1977), In re Wertheim, 541 F. 2d 257, 
265, 191 USPQ 90, 99 (CCPA 1976), In re 
Bowen, 492 F.2d 859, 864, 181 USPQ 48, 52 
(CCPA 1974), In re Smyth, 480 F.2d 1376, 
1385, 178 USPQ 279, 286 (CCPA 1973). 4 . 

Apparently reluctant to reverse on the 
sole ground that an improper statutory 
provision had been employed, and recogniz- 
ing the burden on the parties inherent in a 
return of the case for application of §112, 
this court has reviewed §132 claim rejec- 
tions on the basis of whether the rejected 
claim found support in the original dis- 
closure. See In re Eickmeyer, 602 F.2d 974, 
981, 202 USPQ 655, 662 (CCPA 1979); In 
re Barker, 559 F.2d 588, 593-94, 194 USPQ 
470, 474 (CCPA 1977); In re Winkhous,'527 
F.2d 637, 640, 188 USPQ 129, 131 (CCPA 
1975). 

Similarly, new matter rejections of claims 
under §251 have been reviewed oh the basis 
of a §312 analysis, that is* on whether a 
claim found support in an original patent. 
See In re East, 495 F.2d 1361, 1366, 181 
USPQ 716, 719 (CCPA 1974). 

As is illustrated, in the present case, 
Employment of §§132 and 112 as in- 
terchangeable leads to confusion of two dis- 
tinct concepts: (1) the adding of new matter 
to the disclosure; and (2), the broadening of 
a claim. 

[2] Broadening a claim does hot add new 
matter to the disclosure. Disclosure is that 
which is taught, not that which is claimed. 5 
An applicant is entitled to claims as broad 
as the prior art and his disclosure will allow. 

[3] The proper basis for rejedion of a 
claim amended to recite elements thought to 
be without support in the original dis- 
closure, therefore, is §312, first paragraph, 
not §132. The latter section prohibits addi- 
tion of new matter to the original disclosure. 
It is properly employed as a basis for objec- 



4 MPEP 706.03(o), 608.04-608.04(c), and 
3411.02 relate to considerations set forth herein. 

s We deal here with rejection of amended 
claims, and, by implication, with rejection of en- 
tire new claims submitted after filing. An original 
claim is part of the disclosure at the time of filing. 
In re Anderson, 471 F.2d .1237, 1238, 176 USPQ 
331, 332 (CCPA 1973). Consideration of an 
original claim as evidencing Support in the dis- 
closure for later submitted claims - does not 
warrant employment of §132 as a basis for rejec- 
tion of later submitted claims on the ground that 
the latter are adding new matter to the original 
claim portion of the disclosure. To so hold would 
render §132 redundant in light of §112, first 
paragraph. 



tion to amendments to the abstract, 
specifications, or drawings attempting to 
add new disclosure, to that originally 
presented. Past opinions of this court, ; in 
cases in which a §132 claim rejection was 
reviewed on a §112 analysis, should not in 
future be viewed as having approved the 
employment of §132 as a basis for claim re^ 
jection. The amended claims involved in 
those cases should have been rejected under 
§112, first paragraph/The claim rejections 
in. those cases could then have been explicit^ 
ly affirmed or reversed on direct 
applications of §132, rather than on §312 
analyses applied, to §132 rejections. 6 Accor- 
dingly, such cases are overruled insofar as 
they approved rejection of claims under 
§132. 

Turning to the merits of this appeal, we 
will again treat a § t32 claim rejection before 
us as though it had been made under §112, 
first paragraph. 1 We proceed to decide the 
case on that basis in the interest of judicial 
economy. Were we to merely reverse the re- 
jection as having been made under an inap- 
propriate statutory provision, and . say no 
more, the PTO would presumably enter a 
rejection under §112 . and that decision 
would then be appealable to ; this court, 

Amended claim 6 recites the adhering 
step as "adheringly applying" one layer of 
tube to an adjacent earlier layer. 
Rasmussen's specification describes that 
step as follows: " [Ajdhesive is applied to the 
tubular foil 4 in a narrow or broader strip, 
possibly in two narrow strips. Accordingly, 
the face of the tubular foil successively sticks . 
to the winding lying on the drums." The 
language of the specification thus describes 
one method of "adheringly applying" one 
layer to the other. - ■ " ■ 

[4] As above indicated, that a claim may 
be broader than the specific embodiment 
disclosed' in a specification is in itself of no 
moment. Indeed, the statutory provision for 
broadened claims in reissue applications is 
, intended to meet precisely the situation in 
which a patentee has claimed "less" than he 
had a right to claim. 35 USC 251. 

In In re Smythe, 480 F.2d 1376, 1384, 178 
USPQ 279, 285 (CCPA .1973), this court 
stated: 

A hypothetical situation may make our 
point clear. If the original specification of 



* Similarly, rejections of claims for lack of sup- 
port when required in reissue applications should 
be made under §112, first paragraph, rather than 
under the new matter prohibition of 35 USC 251. 
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Hedaya Brothers, Inc. v. Capital Plastics, Inc. 



a. -patent application on the scales of 
justice- disclosed only a 1-pound 'Head 
weight" as a counterbalance to determine 
the weight of a pound of flesh, we do not 
believe the applicant should be prevented, 
by. the :stf-called "description re- 
quirement "of the first paragraph of §1 12, 
or. the prohibition against new. matter of 
§132, from later claiming the counter- 
balance as a "metal weight" or simply as 
a 1-pound "weight" although both 
"metal weight" and "weight" would- in- 
deed be progressively broader than "lead 
weight," including , even such, an ..un- 
disclosed, but obviously art?recognized 
equivalent,, "weight" as a pound. of - 
feathers. The broader, claim language 
would be permitted because the description 
of the use and Junction of. the lead weight as a 
scale counterbalance in the whole disclosure 
would immediately convey to any person 
skilled in the, scale art the knowledge that 
the applicant invented a scale ; with a 1- 
pound counterbalance weight, regardless 
of ifs composition. 
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[5] Similarly, one skilled in the : art who 
read Rasmussen's specification would un- 
derstand that it is unimportant how the 
layers are adhered, so 'long as they are 
adhered. 7 Thus- the phrase "adheringly 
applying" is supported by the example 
found: in- the specification. • < 

Conclusion 

The phrase "adheringly applying" being 
supported in the specification, rejection of 
that claim under 35- USC 132, first 
paragraph, is reversed. Rejection under.the 
appropriate statutory provision, 35 USC 
112, would have been inappropriate. . 

Reversed 



The board seemed to realize that 35 (JSC 1 1 2 
requires disclosure of only one mode of practicing 
the invention, but nevertheless insisted upon a 
boilerplate recitation in the specification that the 
specific embodiment shown was not meant to 
limit the breadth of the claims, or that the exam- 
ple given was only one of several methods which 
could be employed. Such insistence is .here an ex-, 
altation of form over substance. 
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Hedaya Brothers, Inc. 
v. Capital Plastics, Inc. 

No. 79 Civ. 4104 
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COPYRIGHTS 

1. Infringement — Tests of (§24.209) 

* : Test for determining whether copyright 
-infringement hasr occurred is whether 
average lay observer would find substantial 
similarity: in designs, recognizing copy as 
appropriation of copyrighted work; plaintiff 
in copyright infringement action can only 
prevail if overall aesthetic impressions 
created by designs are substantially same. 

2. Infringement — In general (§24.201) 

Matter copyrightable — In general 
(§24.301) . 5 

In making determination of copyright in- 
fringement, it must be remembered that 
plaintiffs copyright does not protect idea of 
design, but only plaintiff's particular expres- 
sion of that idea; even if defendant has 
sedulously borrowed each of plaintiffs 
ideas, that alone is not violative of copyright 
statute. 

3. infringement— In general (§24.201) 

That ideas contained in design were ,old 
hat and originality of design was minimal 
must also be weighed in considering claim of 
infringement because, where basic design is 
not .original with plaintiff small variations 
by subsequent designers may protect them 
from charges of infringement* 

4. Infringement — Tests of (§24.209) 

Accused =: has not infringed plaintiffs 
copyrights where aesthetic appeals of par- 
ties' respective designs are not substantially 
same, and would not strike average lay 
observer, as being substantially same. 

5. In general (§24.01) 

Notice of copyright (§24.35) 

Copyright' infringement action is govern- 
ed by Copyright Act of 1909 where designs 
in question were created and published 
prior, to January 1, 1978, effective date of 
Copyright Act of 1976; under old Act, 
statutory protection was obtained only by 
publication of work with notice of copyright 
required by what was formerly 17 U.S.C. 
10, in form required by former 17 U.S.C. 19; 
while under saving provision of former 17 
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data. Another method much in vogue then 
was the use of the analytical ultracentrifuge, 
bv which, on the same time photograms 
used for measuring the sedimentation coef- 
ficient (by monitoring the movement of a 
boundary through Schlieren optics), one 
could also assess the diffusion coefficient 
( 18) 

More recent methods for measuring dif- 
fusion coefficients of macro molecules are 
based on dynamic laser light scattering 
(19). Clearly, all these methods require 
dedicated equipment and are not easily 
available to scientists in nonspecialized 
laboratories. The present method is based 
on commonly available equipment (an in- 
strument for capillary zone electrophoresis 
or just a capillary, pump, and UV detec- 
tor), now standard in most biochemical 
laboratories, and allows an easy and repro- 
ducible determination of D values for both 
small analytes and macromolecules. How- 
ever, a relatively large difference between 
D and D ta s f° r macromolecules indi- 
cates that slower flows should be used in 
such cases. 

The present method allows for a quick 
and precise estimation of the molecular 
diffusion coefficient and, thus, of the ra- 
dius of a molecule in a wide range of 
molecular mass values and might be useful 
in a larger number of chemical and bio- 
chemical laboratories than in the past. An 
additional advantage of this application of 
Taylor's approach is that only small vol- 
umes of solution (fractions of nanoliters) 
are required. 
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Synthesis of Proteins by Native 
Chemical Ligation 

Philip E Dawson, Tom W. Muir, lari Clark-Lewis, 
Stephen B. H. Kent* 

A simole technique has been devised that allows the direct synthesis of native backbone 
size. Chemoselective reaction of two unprotected peptide segments 
a ves an initial thioester-iinked species. Spontaneous rearrangement of th.s transient 
Mediate % ds ; a full-length product with a native peptide bond at the ligation site. The 
utHW of native chemical ligation was demonstrated by the one-step prepara ion .of 
cSine containing multiple disulfides. The polypeptide ligation product was folded and 
oSSd S native disulfide-containing protein molecule. Native chemical ligation 
S«?Sp«Snt step toward the general application of chemistry to proteins. 
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Proteins owe their diverse properties to the 
precisely folded three-dimensional struc- 
tures of their polypeptide chains. This is the 
defining feature of a protein, rather than 
size or molecular mass per se. Merely de- 
scribing the three-dimensional structure of 
a protein is insufficient to fully explain its 
biological properties. A better understand- 
ing of how structure dictates the biological 
properties of a protein would be achieved by 
systematically, varying the covaleht struc- 
ture of the molecule and correlating the 
effects with the folded structure and biolog- 
ical function. 

In this report, we describe an important 
extension of the chemical ligation method 
(I) to allow the preparation of proteins 
with native backbone structures. The prin- 
ciple of "native chemical ligation" is shown 
in Fig. 1. The first step is the chemoselec- 
tive reaction of an unprotected synthetic 
peptide-a-thioester (2, 3) with another un- 
protected peptide segment containing an 
amino-terminal Cys residue to give a thio- 
ester-iinked intermediate as the initial co- 



(1970). 

11 It is worth remembering that the theory and expen- 
' mental methodology for capillary zone electrophore- 
sis had been developed by the mid-1970s [S. 
Hjerten. Chromatogr. Rev. 9. 122 (1967); R. vir- 
tanen, Acta Pofytech. Scand. 123, 1 (1974)], al- 
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valent product. Without change in the re- 
action conditions, this intermediate under- 
goes spontaneous, rapid intramolecular re- 
action to form a native peptide bond at the 
ligation site. The target full-length polypep- 
tide product is obtained in the desired final 
form without further manipulation. We be- 
lieve that general synthetic access of this 
type will allow almost unlimited variation 
of the covalent structure "of the protein 
molecule. 

Model studies were undertaken with 
small peptides to investigate the native 
chemical ligation approach (4). These stud- 
ies were consistent with the mechanism 
shown in Fig. 1, in which the initial thio- 
ester ligation product was not observed as a 
discrete intermediate because of the rapid 
rearrangement to form a stable peptide 
bond. Facile intramolecular reaction results 
from the favorable geometric arrangement 
of the a-NHj moiety with respect to the 
thioester formed in the initial chemosejec- 
tive ligation reaction. The use of such "en- 
tropy activation" for peptide bond forma- 
tion is based on principles enunciated b\ 
Brenner (5) and more recently adopted b\ 
others (6). . - 

Study of a variety, of model pepuov 
established that native chemical ligation 
was generally applicable to peptides con 
raining the full range of functional group 
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normally found in proteins (7). As de- 
bribed in this report, native chemical liga- 
tion is limited to reaction at an amino- 
cerminal Cys residue. It was important to 
prevent the side chain thiol of this Cys from 
oxidizing to form a disulfide-linked dimer, 
because this was unreactive in the ligation, 
/^n excess of thiol corresponding to the 
cri jester leaving group was used to keep the 
Cys residues in reduced form without inter- 



fering with the ligation reaction. The ami- 
no-terminal peptide segment must be pre- 
pared by chemical synthesis to equip it with 
the necessary ot-COSR functionality 
(where R is an alkyl group) (2). Further- 
more, for optimal ligation, this component 
should have an unhindered (that is, non 
^-branched) carboxyl-terminal amino acid. 
Solubilizing agents such as urea or guani- 
dine hydrochloride did not interfere with 



the ligation and could be used to enhance 
the concentration of peptide segments, and 
thus increase the reaction rate. 

Further model reactions showed that 
the use of better thioester leaving groups 
resulted in faster ligation reactions. We 
applied this observation to the native 
chemical ligation of peptides from the ex- 
tracellular domain of a human cytokine 
receptor (8) (Fig. 2). Use of the 5-thio-2- 
nitrobenzoic acid (-SNB) leaving group, 
corresponding to the reduced form of El- 
man's reagent, gave rapid reaction in high 
yield. As described in the legend to Fig. 2, 
the reaction between the peptide segments 
was observed to have gone essentially to 
completion in less than 5 min, giving the 
50-residue product with a native peptide 
bond at the site of ligation. Thus, rapid 
native chemical ligation can be achieved 
by use of a thioester leaving group with 
suitably tuned properties. 

Application of the native chemical liga- 
tion method to the total synthesis of a 
protein molecule was illustrated by the 
preparation of human interleukin 8 (IL-8) 
(9). The 72-amino acid polypeptide chain 
contains four Cys residues, which form two 
functionally critical disulfide bridges in the 
native protein molecule (9). The total syn- 
thesis of IL-8 is shown in Fig. 3. The two 
unprotected synthetic peptide segments re- 
acted cleanly to give the full-length 
polypeptide chain in reduced form without 
further chemical manipulation (10). This 
successful ligation was particularly signifi- 
cant because the 33- and 39-residue IL-8 
segments each contained two Cys residues 
and together encompassed 18 of the 20 
genetically encoded amino acids found in 
proteins. The purified product was folded 
and oxidized as previously described (9) to 
give IL-8 with a mass precisely 4 daltons less 
than that of the original ligation product, 
indicating the formation of two disulfide 
bonds. The properties of this folded product 
were identical to those of authentic IL-8 
samples [II), This result unambiguously 
confirmed the formation of a peptide bond 
at the ligation site, because the thioester- 
to-amide rearrangement must have taken 
place to give the free Cys 34 side chain that 
formed the native disulfide bond (see Fig. 
3A). 

What is likely to be the impact of native 
chemical ligation on the study of proteins? 
Proteins are usually studied by expression in 
genetically engineered microorganisms with 
the methods of recombinant DNA-based 
molecular biology. Methods such as site- 
directed mutagenesis (12) have had a rev- 
olutionary impact on the ability to prepare 
large numbers of modified proteins in useful 
amounts for systematic study (13). Innova- 
tive approaches have increased the range of 
amino acids that can be incorporated in 



pig. 1. The principle of native chem- 
ical ligation. The synthetic segment, 
peptide 1. which contains a thioester 
at the a-carboxyl group, undergoes 
nucleophilic attack by the side chain 
of the Cys residue at the amino termi- 
nal of peptide 2 (R is an alkyi group). 
The initial thioester ligation product 
undergoes rapid intramolecular reac- 
tion because of the favorable geo- 
• metric arrangement [involving a five- 
membered ring] of the a -amino 
—up of peptide 2, to yield a prod- 
with a native peptide bond at 
the ligation site. Both reacting pep- 
tide segments are in completely un- 
protected form, and the target pep- 
tide s obtained in final form without 
fun. sr manipulation. 
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Fig. 2. Rapid native chemical ligation reaction, 
illustrated by the synthesis of a peptide segment 
corresponding to residues 46 to 95 from the ex- 
ternal domain of the human IL-3 receptor p-sub- 
unit (8). (A) Monitoring by ultraviolet (UV) absor- 
banca. Ligation was initiated by adding 
[Cy- "](77-95) to purified Msc(46-76)aCOSNB 
(27) at the stated pH and the reaction was moni- 
tored by UV [the substituted aryl thiolate leaving 
group has a characteristic UV absorbtion at 412 
nm (e. 



TNlB, 412 nm ' 



13.700 dm 3 mol"' cm' 1 )]. At 
pH 7.0. the reaction was essentially complete 
within 5 min. No reaction was observed when 
,. M sc(46-76)aCOSNB was exposed to a 10-fold 
. ' .ar excess of Leu-enkephaiin (amino-terminal 
residue, Tyr) at pH 5.0. This control experiment 
con;Vms the absolute requirement for an amino- 
terrr-nal Cys residue at the site of ligation. (B) 
Monitoring by HPLC. Purified [Cys 77 ](77-95) 
(0.96 mM) and (45-76)aCOSNB (0.9 mM) were 
r &acted in 8 M urea, pH 5.0. 50* mM ammonium 
acetate buffer at 23°C. Analytical HPLC (C l8 re- 
versed phase; 22.5 to 45% acetonitrile at 0.7% 
Per minute; monitored at 214 nm) of the individ- 
ual components is shown {upper trace). After 1 
hour, the ligation solution was exposed to the 
Educing agent tris(2-carboxyethyl)phosphine at 
o q ^ subsequently raised to pH 13 to 
f em: ve the Na-Msc moiety. Analytical HPLC, 
u ndfe t - the same conditions, of the crude product 
18 shown (lower trace). The 50-residue product 
^d the expected molecular mass by electrospray 
spectrometry [observed, 5747.0 daltons; 
calculated (average isotope composition), 5747.4 
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^tonsj. The ligation product was shown to be stable at high pH and to reducing conditions, and formed 
30 intramolecular disulfide. These observations are consistent with the presence of a native peptide bond 
at the site of ligation. 
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expression systems (14) and promise to sig- 
nificantly extend the utility of biosynthetic 
modification of the covalent structure of 
proteins. However, there appear to be lim- 
itations inherent to the nature of ribosomal 
protein synthesis (14). 

In favorable cases, chemical synthesis has 
already made important contributions to the 
exploration of the relationship of protein 
structure to function. Stepwise solid phase 
synthesis has permitted the de novo prepa- 
ration of small proteins (15), and there have 
been several notable examples of the use of 
this method of total protein synthesis to 
explore the molecular basis of biological 
function (16). Another method that has in 
special instances allowed chemistry to be 
applied to the study of proteins is semisyn- 
thesis through the conformationally assisted 
religation of peptide fragments (17). An im- 
portant extension of the semisynthesis ap- 
proach is the use of enzymatic ligation of 
cloned or synthetic peptide segments (18). 
Although these methods currently have se- 
vere limitations, there continues to be seri- 
ous interest in the wider application of the 
tools of organic chemistry to the study of 
proteins (15). 

Recently, we introduced the chemical li- 
gation of unprotected peptide segments as an 
improved route to the total synthesis of pro- 
teins (1). The key aspect of this approach 



was the use ot ^b^iftc^^ctire reaction to- 
specifically and unambiguously join peptides 
by formation of an unnatural (that is, non- 
peptide) backbone structure at the ligation 
site. It has permitted the facile preparation of 
a wide range of backbone-modified proteins, 
including analogs of protein domains (19) 
and of the human immunodeficiency virus- 1 
(HIV-1) proteolytic enzyme (J, 20). Chem- 
ical ligation has also proven to be useful for 
the routine, reproducible synthesis of large 
amounts of proteins in high purity with full 
biological activity (21), as welt as for the 
straightforward production of protein-like 
molecules of unusual topology (22). Howev- 
er, the range of proteins accessible by this 
technique is limited by the size of the syn- 
thetic peptide segments (23). A useful ex- 
tension would occur if we had direct synthet- 
ic access to native backbone polypeptide 
chains up to the size of typical protein do- 
mains (24). Chemical ligation would then 
allow us to string these domains together to 
explore the world of proteins in a general 
fashion. 

Native chemical ligation provides pre- 
cisely that capability. It combines the forma- 
tion of a native peptide bond at the ligation 
site with the advantages of chemoselective 
reaction of unprotected peptides (J). This 
second-generation ligation chemistry dra- 
matically increases the size of native back- 



fane polypeptides directly accessible by total 
'chemical synthesis (25). It can be usefully 
applied to a wide range of synthetic targets, 
including proteins of moderate size, and it 
allows direct access to protein functional 
domains (24). Native chemical ligation is a 
foundation stone of a general modular ap- 
proach to the total chemical synthesis of 
proteins (26). 
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Fig. 3. Synthesis of IL-8 by native chemical ligation. 
(A) Synthetic scheme leading to folded [Ala^JIL-S 
(2d). (B) Analytical HPLC (C 18 reversed phase; 25 to 
45% acetonitrile at 1% per minute; monitored at 214 
nm) of the synthetic peptide segments (29), 11-8(1 -33)aCOSBzl and IL-8(34-72), each shown before 
reaction was initiated (30). (C) Analytical HPLC under the same conditions of the purified ligation product, 
IL-8(1-72)(SH) 4 , in fully reduced form. (Inset) Electrospray mass spectrum (raw data displayed as a single 
charge state): observed molecular mass 8319.8 daltons; calculated molecular mass (average isotope 
composition), 8319.8 daltons. (D) Air oxidation of the purified 1-72 ligation product to form the folded 
[Ala 33 ]lL-8 molecule, shown after HPLC purification. The earlier elution of the folded, disulfide cross-linked 
native protein compared with, the reduced polypeptide is typical (9). Folding and oxidation conditions: 
polypeptide at 0.2 mg/ml, 1 M guanidine-HCI, pH 8.5 tris buffer, and vigorous stirring in air at ambient 
temperature. (Inset) Electrospray mass spectrometry of the oxidized and folded synthetic IL-8 (raw data 
displayed as a single charge state). Observed molecular mass, 8315.6 daltons; calculated molecular 
mass (average isotope composition), 8315.8 daltons. 



l) Chemical ligation JM. Schnolzer and S. B. H. Kent, 
*' Science 256, 221 (1 992)] involved the chemoselectwe 
reaction of unprotected peptides to give a product 
with an unnatural backbone structure at the ligation 
site. Use of unprotected peptides circumvented the . 
difficulties inherent to classical chemical synthesis, 
namely, complex combinations of protecting groups 
that lead to limited solubility of many synthetic inter- 
mediates [for example, K. Akaji ef a/., Chem. Pharm. 
Bui {Tokyo) 33. 1 84 (1 985)]. In contrast, the chemical 
ligation techpique has allowed us to make good use of 
me ability to routinely make, purify, and characterize 
unprotected peptides 50 or more residues in lenctn 
(23). 

2. Peptide 1 in Fig. 1 . The key feature is the -aCOS=l 
moiety, which is readily generated from the peptiae- 
aCOSH prepared by highly optimized stepwise sol- 
id-phase peptide synthesis on a thioester resin. The 
thioester resin was prepared by a generalized ver- 
sion (L. E. Canne and S. B. H. Kent! manuscript in 
preparation; details can be obtained from the au- 
thors) of the Blake- Yamashiro procedure (3). Peptide 
products were cleaved, purified, and characterized 
by standard methods {23). 

3. J. Blake, int. J. Pept. Protein Res. 17, 273 (1 981); 0. 
Yamashiro and C. H. U ibid. 31 . 322 (1988). 

4. To help explore the mechanism of the rea ction, t ; >9 
peptide Leu-Tyr-Arg-Aia-Gty-aCOSBzJ (3zl. ber.:>fl 
was reacted with Ac-Cys (Ac, acetyl). The exact mass 
of the resulting ligation product was determined by 
electrospray mass spectrometry and was consistent 
with a thioester-linked peptide as the ligation product 
generated by nucleophiiic attack ol the Ac-Cys side 
chain on the a-thioester moiety of the peptide. Reac- 
tion of Leu-Tyr-Arg-Ala-Gty-aCOSBzl with H-Cys- 
Arg*Aia-Glu-Tyr*Ser (containing an unblocked a-NHj 
functional group) proceeded rapidly at pH 6.8 (below 
pH 6, the reaction proceeded very slowly, suggesting 
tne involvement of the ionized thiolate form of the Cys 
side chain) and gave a single product of the expected 
mass. This product lacked susceptibility to nuclc> 
philes and had the ability to form disutfide-iinked 
dimeric peptides, indicating unambiguously the for- 
mation of a native amide bond at the ligation site. 

5. M. Brennet, in Peptides. Proceedings of the Eighth 
European Peptide Symposium, H. C. Beyerman, Ed. 
{North-Holland, Amsterdam. 1967). pp. 1-7. 

6. D. S. Kemp and R. I. Carey, J. Org. Chem. 58, 2216 

(1993) ; C.-F. Uu and J. P. Tarn, J. Am. Chem. Soc 
116.4149(1994). 

7. Even free internal Cys residues may be present in 
either of the reacting segments. Interna! Cys residues 
can undergo ester exchange with the peptide-a-th.> 
ester component; however, this reaction is unproo.-:- 
tive because no rearrangement to the amide bc~~ 
can occur; the thioester formed is readily reversion 
and remains a productive part of the reacting system. 

8. R. -D* Andrea et ai. Blood 83. 2802 (1994). 

9. M. Baggiolini and I. Clark-Lewis, FEBS Lett. 307. 97 
(1989); I. Clark-lewis, B. Dewald, M. Loetscher. B 

' Moser, M. Baggiolini, J. Biol. Chem. 269, 16075 

(1994) ; I. Clark-Lewis. Biochemistry 30. 3128 (1991) 
K. Rajarathnam, I. Clark-Lewis, B. 0. Sykes. ibid. 29 
1689 (1994). 

1 0. Analogous methods have required removal of P f 0' 
tecting groups (3, 6) or conversion of intermedial 
to the final form (6), or both steps. No previous me-'-fl 
od has allowed the chemical reaction of unproteci* 
peptide segments to directly yield a native backbo^ 
final product. 

1 1 . Titration in an assay for neutrophil eiastase releas- 
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(9) demonstrated that the potencies [median effec- 
tive dose {EDgJ = 0.3 nM] and maxima) responses 
of the folded, ligated [Ala^lll-S and the correspond- 
ing molecule obtained by conventional synthesis (9) 
were indistinguishable and identical to native se- 
quence IL-8. 

12. M. Smith, Angew. Chem. Int. Ed. Engl 33, 1214 
(1994). 

1 3. C. Bgenbrot and A. Kossiakoff, Curr. Op/n. Biotech- 
nol. 3,333(1992). 

! ;. C. J. Noren, S. J. Anthony-Cahill, M, C. Griffith, P. G. 
Schultz, Science 244, 182 (1989); J. A. EBman. 0. 
Mendel. P. G. Schultz. ibid. 255, 197 (1992); V. W. 
Cornish et a/.. Proc. Natl. Acad. Sci. USA. 91 . 2910 
(1994). 

1 5. T. W. Muir and S. B. H. Kent. Curr. Opin. Biotechnof. 

4. 420(1993). 

16. M. Miller et a/., Science 246, 1149 (1989); A. 
Wiodawer ef at., ibid. 245, 616 (1989); L. H. Huang, 
H. Cheng, A. Pardi, J. P. Tarn, W. V. Sweeney. Bio- 
chemistry 30. 7402 (1991); K. Rajarathnam ef aJ., 
Science 264,90(1994). 

1?. R. E. Offord, in Protein Design and the Development 
of New Therapeutics. and Vaccines, J. B. Hook and 

* G. Poste. Eds. (Plenum, New York, 1990). pp. 253- 
282: C. J. A. Wallace and I. Clark-Lewis. J. Biol. 
Chem. 267,3852(1992). 

18. I. Abrahmsen ef a/.. Biochemistry 30, 415 1 (1991); 
T. K. Chang. D. Y. Jackson, J. P. Burnier, J. A. Wells. 
Proc. Natl. Acad. Sci. U.SA., in press. 

19. Ugated 10F3, the integrin-binding module of fi- 
bronectin: 95 residues (M. Williams, T. Muir, M. Gins- 

N berg. S. B. H. Kent, J. Am. Chem. Soc. in press). 

20. Catalytic contribution of flap- substrate hydrogen 
bends in HIV- 1 protease explored by chemical syn- 
theses: homodimer of 99-residue subuntts [M. Baca 
and S. B. H. Kent. Proc. Natl. Acad. Sci. U.SA. 90, 
11638(1993)1. 

2 * . a C. deLisfe Milton. S. C. F. Milton, M. Schnolzer, S. 
B. H. Kent, in Techniques in Protein Chemistry IV 
(Academic Press, New York, 1992). pp. 257-267. 

22. Fcur-helix bundle template- assembled synthetic pro- 
tein: molecular mass 6647 daJtons (P. E. Dawson and 

5. 3. H. Kent. J. Am. Chem. Soc. 115. 7263 (1993)); 
homogeneous multivalent artificial protein: molecular 

• mass 1 9.9 1 6 daltons [K. Rose, ibid. 1 1 6. 30 (1 994)]; 
artificial neoprotein mimic of the cytoplasmic domains 
of a multichain iniegrin receptor: molecular mass 
14.194 daltons (T. W. Muir, M. J. Williams, M. H, 
Ginsberg, S. B. H. Kent, Biochemistry 33. 7701 
M994)]: peptide dendrimer: molecular mass 2^.205 
csitcns [C. Rao and J. P. Tarn. J. Am. Chem. Soc. 
116. 6975(1994)). 

23. Ey using optimized stepwise solid-phase methods 
(M. Schnolzer. P. Alewood. D. Alewood. S. B. H. 
Kent. Int. J. Pept, Protein Res, 40 ; 180 { 19921]. the 
preparation in good yield and high purity of peptides 
up to 60 residues is routine; in favorable cases, pep- 
tides with >80 residues can be prepared. 

A. L. Berman, E. Kolker. E. N. Trifonov, Proc. Natl. 
' Acad. Sci. U.S.A. 91. 4044 (1994). 
'dt. The carboxyl -terminal peptide segment or protein 
module could also be expressed by standard recom- 
binant DNA methods: provided the product con- 
tained an amino- terminal Cys residue, it could be 
reacted with the synthetic amino-terminal peptide- 
ttCOSR by using the native cnemical ligation der 
scribed here to give a product in which part of the 
protein had been derived from chemical synthesis 
and part from ribosoma! synthesis. 

26. A modular strategy for the total synthesis of proteins 
has been developed based on the convergent chem- 
ical ligation of unprotected peptides (L E. Canne. S, 
K. Burley, S. B. H. Kent, paper presented at the An- 
nual Meeting of the Protein Society, San Diego. July 
1 994). Protein domains (modules) were prepared by 
chemical ligation of 50 to 70 residue segments; these 
domains were then stitched together to give the target 
prciein. Mutually compatible ligation chemistries are 
recuired: Intradomain ligation should optimally yield a 
stable, peptide-tike bond, whereas interdomain liga- 
tion wHI tolerate a wider variation of properties of the 
structure formed at the ligation site. 

27. Crude synthetic Msc(46-76)aCOSH (Msc. 2(meth- 
ylsuifonyljethyloxycarbonyl] was converted to the 

• 5-tfiio-2-nitrobenzoic acid ester (-COSNB) by 



treatment with 5,5'-dithio-bis(2-nitrobenzoic acid) 
[10 equivalents (eq)l in 8 M urea. pH 4.0 50 mM 
ammonium acetate buffer. This thipester-contain- 
ing material was found to be completely stable 
below pH 6.0, and was readily purified by reversed- 
phase high-performance liquid chromatography 
(HPLC). 

28. [Ala^JIL-S was chosen as the synthetic target for con- 
venience; previous work had shown that this mutant 
IL-8 had full biological activity (9), and a supply of the 
Boc-Ala (Boc, butyloxycarbonyi) thioester resin was 
on hand for other applications. The folded structure 
shown is based on the x-ray structure of the IL-8 
monomer [E. T. Baldwin ef a/., Proc. Natl. Acad. Sci. 
U.S.A 88. 502 (1991)). 

29. The IL-8 peptide segments were prepared by opti- 
mized stepwise solid-phase synthesis {23) and were 
purified by reversed-phase HPLC and characterized 
by standard methods. Crude synthetic segment IL- 
8(1-33)aCOSH was converted to the thiobenzyl es- 
ter by reaction with benzyl bromide (15 eq) in 6 M 



guanidine-HCI. pH 4.6, sodium acetate buffer, prior 
to purification under standard reversed-phase HPLC 
conditions. 

30. The segments (1 -33)aCOSBzl (5.0 mg; 1.3 jtmol) 
and 34-72 (4.8 mg, 1 . 1 ^mof) were reacted in 0.5 ml 
6 M guanidine-HCI, pH 7.6, phosphate buffer at 
23°C in the presence of benzyl mercaptan (5 pj)]. 
After suitable reaction time (48 to 72 hours), a ligation 
yield of -60% was obtained. The product was puri- 
fied by reversed-phase HPLC and characterized by 
electrospray mass spectrometry. 

31. We gratefully acknowledge the assistance of T. 
Walters and M. Baca in the early stages of this 
work, of L. Canne in providing the thioester resin, 
of B. Dewald for some of the elastase release as- 
says, and of R. Simon and S. Clark for critical 
comments on the manuscript. Supported by fund- 
ing from NIH [GM48897-01 and GM48870-03 
(S.B.K.); GM 50969-01 (I.C.L.)]. 
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A Theropod Dinosaur Embryo and the Affinities 
of the Flaming Cliffs Dinosaur Eggs 

Mark A. Norell,* James M. Clark.t Dashzeveg Demberelyin, 

Barsbold Rhinchen, Luis M. Chiappe, Amy R. Davidson, 
Malcolm C. McKenna, Perle Aitangerel, Michael J. Novacek 

An embryonic skeleton of a nonavian theropod dinosaur was found preserved in an egg 
from Upper Cretaceous rocks in the Gobi Desert of Mongolia. Cranial features identify the 
embryo as a member of Oviraptoridae. Two embryo-sized skulls of dromaeosaurids, 
similar to that of Velociraptor, were also recovered in the nest. The eggshell microstructure 
is similar to that of ratite birds and is of a type common in the Djadokhta Formation at the 
Flaming Cliffs (Bayn Dzak). Discovery of a nest of such eggs at the Flaming Cliffs in 1923, 
beneath the Oviraptor philoceratops holotype, suggests that this dinosaur may have been 
a brooding adult. 



Dinosaur eggs are abundant in Upper Cre- 
taceous rocks of the Gobi Desert (1,2), but 
embryonic skeletons from these deposits are 
scarce. Definitive remains include numerous 
bird embryos (3) and a single fragmentary 
specimen of an omithischian hind limb (4). 
Because the definitive taxonomic identity of 
eggs requires the presence of identifiable em- 
bryonic remains within them, the identity of 
most egg types present in Upper Cretaceous 
beds in Mongolia has been unclear (J, 2). 

In 1993, a rich Upper Cretaceous fossil 
locality in the Gobi Desert was discovered 
(5). The site, Ukhaa Tolgod, is in the north- 
eastern Nemegt Basin, Omnogov Aimak, 
near the salt extraction settlement of Daus. 

M. A. Norell. J. M. Clark, L. M. Chiappe. A. R Davidson. 
M. C. McKenna. M. J. Novacek, Department of Verte- 
brate Paleontology, American Museum of Natural Histo- 
ry. Central Park West at 79th Street. New York. NY 
10024, USA. 

Dashzeveg D. and Barsbold R., Geological Institute. 
Mongolian Academy of Sciences. Ulaan Baatar 1 1 , Mon- 
golia. 

Perle A., Mongolian Museum of Natural History, Ulaan 
Baatar 46, Zaluchudiin Gudama-1 , Mongolia. 
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* Present address: Department of Biological Sciences, 
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USA. 



In addition to over 300 mammal and lizard 
skulls, 20 theropod skeletons (including sev- 
eral adult and juvenile oviraptorids), and 
many protoceratopsian and ankylosaurid di- 
nosaurs discovered at this locality, at least 
five types of eggs were found. Many of these 
were arranged in nests. One egg, from a 
heavily weathered nest, contains the nearly 
complete skeleton of an embryonic ovirap- 
torid dinosaur (Fig. I). Also among the bro- 
ken eggshell fragments in this nest were two 
tiny skulls (-5 cm long) of a dromaeosaurid 
theropod, one preserved with eggshell adher- 
ing to it (Fig. 2). 

The red sandstones of Ukhaa Tolgod 
probably belong to either the Djadokhta For- 
mation or the Barun Goyot Formation and 
lie 35 km east of the Barun Goyot type 
section (6). Limited studies indicate the 
presence of taxa typical of either or both the 
Barun Goyot and the Djadokhta formations 
(that is, Velociraptor, Mononykus, and the 
mammals Zalambdalestes, Bulgaanbaatar, 
Nemegtbaatar, and Catopsdis). These faunas 
are considered correlative with the Judith- 
ian North American land mammal age and 
the Campantan marine stage (7), although 
this con-elation is poorly constrained (8). 
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Chemically Synthesized SDF-la Analogue, N33A, Is a 

Potent Chemotactic Agent for CXCR4/Fusin/LESTR-expressing 

Human Leukocytes* 
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Stromal cell-derived factor (SDF) 1 is a potent che- 
moattractant for leukocytes through activation of the 
receptor CXCR4/Fusin/LESTR, which is a fusion co-fac- 
tor for the entry of T lymphocytotropic human immuno- 
deficiency virus type 1 (HIV-1). This CXCR4-mediated 
HIV-1 fusion can be inhibited by SDF-1. Because of its 
importance in the study of immunity and AIDS, large 
scale production of SDF-1 is desirable. In addition to 
recombinant technology, chemical synthesis provides 
means by which biologically active proteins can be pro- 
duced not only in large quantity but also with a variety 
of designed modifications. In this study, we investigated 
the binding and function of an SDF-la analogue, N33A, 
synthesized by a newly developed native chemical liga- 
tion approach. Radioiodinated N33A showed high affin- 
ity binding to human monocytes, T lymphocytes, as well 
as neutrophils, and competed equally well with native 
recombinant SDF-la for binding sites on leukocytes. 
N33A also showed equally potent chemoattractant activ- 
ity as native recombinant SDF-la for human leukocytes. 
Further study with CXCR4/Fusin/LESTR transfected 
HEK 293 cells showed that N33A binds and induces di- 
rectional migration of these cells in vitro. These results 
demonstrate that the chemically synthesized SDF-la an- 
alogue, N33A, which can be produced rapidly in large 
quantity, possesses the same capacity as native SDF-la 
to activate CXCR4-expressing cells and will provide a 
valuable agent for research on the host immune re- 
sponse and AIDS. 



Stromal cell-derived factor (SDF) 1 1 has been reported to be 
a primordial chemokine of the CXC subfamily and has multiple 
biological activities on a variety of cell types (1—4). SDF-1 was 
initially isolated as a T lymphocyte chemoattractant and was 
found to be active also on monocytes but not on neutrophils (1, 
3). However, its activity on neutrophils was subsequently 
shown by using Ca 2+ mobilization experiments (4). The SDF-1 
gene is located on chromosome 10 (5), while the genes for other 
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known CXC chemokines are located chromosome 4 (6, 7). 
SDF-1 is well conserved with only a single amino acid substi- 
tution between human and murirne molecules (1, 2). SDF-la 
and SDF- 1)3 were believed to be the result of differential splic- 
ing of a single gene, with SDF-la missing four amino acids in 
the carboxyl terminus. The CXCR4/Fusin/LESTR has been 
identified as one of the functional receptors for SDF-1 (3, 4), a 
member of the seven transmembrane spanning receptor super- 
family (8, 9). CXCR4 was initially cloned as an orphan receptor 
(10-12) and was later identified as a fusion co-factor for the 
entry of HIV-1 of the T lymphotropic strain (13). In addition to 
its activities on human leukocytes, SDF-1 has been shown to 
inhibit the fusion and replication of T lymphotropic HIV-1 in 
host cells bearing CD4 and CXCR4 (3, 4). Therefore, SDF-1 
plays a pivotal role in host immune system and its defense 
against infection. 

Rapid production of cytokines and chemokines is essential 
for structure -function studies and design of molecules with 
agonist or antagonist activities. The turbocharged peptide syn- 
thesis technology (14) and chemical ligation of peptide seg- 
ments ( 15) permit the synthesis of chemokines in large quan- 
tity. This approach was therefore utilized to synthesize SDF-la 
based on its great utility in studies of immune responses and 
infectious diseases. However, the thioacid peptide (SDF-loQ- 
33)-aCOSH) could not be readily generated on the standard 
a-thiocarboxylate resin (16) due to the presence of an Asn 33 
residue at the ligation site in SDF-la. Instead, alanine was 
coupled to the a-thiocarboxylate resin, then the mutant SDF- 
la(l-33, N33A) was synthesized. In this study, we report/that 
this SDF-la analogue, N33A, displays full biological activity 
through the activation of the receptor CXCR4. This analogue, 
which can be rapidly produced in large quantity in the absence 
of contaminating peptides, will prove to be an important tool in 
the study of host immunity and AIDS. 

MATERIALS AND METHODS 

Chemokines — Human recombinant SDF-la was purchased from Pep- 
roTech Inc. (Rocky Hill, NJ). Radioiodination of the SDF-la and the 
analogue N33A was performed by a lactoperoxidase-labeling procedure. 
The radioactive ligands were further purified by reversed phase HPLC. 
The specific activity of the radioiodinated chemokines was 2200 
Ci/mmol. 

Chemical Synthesis of SDF-la Analogue, N33A — Boc protected 
amino acids were obtained from the following sources: AnaSpec (San 
Jose, CA), Bach em (Philadelphia, PA), NovaBiochem (San Diego, CA), 
and Peptides International (Louisville, KY). Peptides were synthesized 
on a modified ABI430A instrument using in situ neutralization Boc 
chemistry protocols (17). C-terminal segments were prepared on 
-OCH2Pam resins (ABI, Foster City, CA). N- terminal segments were 
prepared on a-thiocarboxylate resin (16). Standard HF cleavage proto- 
cols were employed following N- terminal Boc removal and drying of the 
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Fig. 1. Binding of radiolabeled N33A to mononuclear cells and cross-competition with rhSDF-lo. Cells were incubated for 40 min at 
room temperature with 0.12 nM radiolabeled N33A in the presence of increasing concentrations of unlabeled ligand. The cells were then harvested 
by centrifugation through a 10% sucrose phosphate-buffered saline cushion and measured for the radioactivity. The data were analyzed with 
LIGAND program. Panel A, binding of 125 I-N33A to monocytes; panel B, binding of l2e I-N33A to T cells; panels C and D, displacement of 
12c I-SDF-lo£ binding to monocytes and T cells by unlableled ligands. 



resin. HPLC purification was performed on Rainin HPLCs (Woburn, 
MA) using Vydac C4 or Dynamax C4 columns with gradient elution (A, 
H 2 0, 0.1% trifluoroacetic acid; B, acetonitrile, 0.1% trifluoroacetic acid). 
Electros pray mass spectrometry was performed on a Sciex 
APIl(PE-Sciex). 

Ligation of the peptide segments was performed at 4 mM peptide 
concentration in 6 M guanidine, 0.1 M Tris, pH 7, in the presence of 33 
mM thiophenol (Fluka, Switzerland) at room temperature. Ligation was 
monitored by HPLC and was typically complete within 24 h. Ligation 
was followed by HPLC purification and lyophilization. After purifica- 
tion, the full-length peptide was reduced at 1 mg/ml in 8 M urea (Fluka), 
0.1 M Tris (Fluka), 5.37 mM EDTA ((Fluka), pH 8.6, in the presence of 
100 mM 2-mercaptoethanol (Fluka). Reduction occurs under a nitrogen 
atmosphere at 40 °C for 1 h. After complete reduction, the mixture was 
reconstituted with the same buffer at 0.2 mg/ml with 18.7 mM oxidized 
glutathione (Sigma). The solution was dispensed into a Spectrum Spec- 
tra/Por*7 dialysis membrane (Houston, TX) (M r cut-off, 3500) and the 
bag was placed in 1 liter of initial dialysis buffer of 8 M urea, 0.1 M Tris, 
1 mM EDTA, 3 mM 2-mercaptoethanol, 1.3 mM oxidized glutathione, pH 
8.6. Over a period of 2 days, 4 liters of 2 M urea, 0.1 M Tris, pH 8.6, was 
pumped into the vessel containing the dialysis bag (18). After lyophili- 
zation, the full-length peptide was oxidized at 1 mg/ml in 2 M guanidine 
HC1 (Fluka), 0.1 m Tris (Fluka), pH 8.6, at room temperature in the 
presence of air. Folding was complete after stirring overnight and was 
monitored by HPLC and mass spectrometry. 

Cells and Che mo taxis Assays — Human peripheral blood leukocytes 
were isolated from normal donors (National Institutes of Health Clin- 
ical Center Transfusion Department, Bethesda, MD) according to the 
established protocols in this laboratory for purification of monocytes 
(purity was >90%) (19), T lymphocytes (purity, >95% CD3 + ) (20) and 
neutrophils (purity, >9S%) (19). The chemokine receptor, CXCR4/fusin/ 
LESTR cDNA was isolated in this laboratory and was stably trans- 
fected into human kidney embryonic epithelial 293 cells (CXCR4/293 
cells) as described previously (21). 

Leukocyte migration was evaluated using a 48-well microchamber 
(Neuroprobe, Cabin John, MD) technique as described previously (19, 
20, 22). The migration of CXCR4/293 cells was also assessed by the 
48-well microchamber technique with the polycarbonate filters (10-/xm 
pore size) (21) precoated with collagen type I (Collaborative Biomedical 
Products, Bedford, MA). The results are expressed as the the chemo- 
taxis index (CI) representing the fold increase in the cell migration 
induced by stimuli versus control medium. All experiments were per- 
formed at least two times, and results from one experiment are shown. 
The statistical significance of the difference between migration in re- 



sponse to stimuli and control was assessed by Student's t test. 

Binding Assays with Radiolabeled SDF-la and N33A — Binding as- 
says were performed using a single concentration of 128 I-labeled che- 
mokines in the presence of increasing concentrations of unlabeled li- 
gands (19, 21). The binding data were analyzed with a Macintosh 
computer program LIGAND (P. Munson, Division of Computer Re- 
search and Technology, NIH, Bethesda, MD). In Scatchard plots, the 
binding data were analyzed with both "one-site" and "two-site" models, 
and only the one-site model better fit the curves obtained with either 
native leukocytes or CXCR4/293 cells. The rate of competition for bind- 
ing by unlabeled ligands was calculated with the following formula: 

% inhibition = 1 - (binding in the presence of unlabeled 

chemokine/binding in the presence of medium alone) X 100 (Eq. 1) 

RESULTS AND DISCUSSION 

We first examined whether the SDF-la analogue N33A was 
able to bind and activate human peripheral blood monocytes 
and T cells. Fig. 1 shows that 12S I-N33A specifically bound to 
human peripheral blood monocytes (Fig. 1A) and T lympho- 
cytes (Fig. IS) with high affinity (1.8 and 1.4 nM for monocytes 
and T cells, respectively). This level of binding to monocytes 
and T cells by 126 I-N33A was comparable to that of 136 I-. 
rhSDF-la as examined in parallel experiments (data not 
shown). Unlabeled N33A displaced 126 I-rhSDF-la binding to 
both monocytes and T lymphocytes (Fig. 1, C and D), and 
likewise, the binding of 125 I-N33A to these cell types was also 
displaced by both unlabeled N33A and rhSDF-lot (not shown). 
Consistent and considerable migration of monocytes and T cells 
was induced by N33A (Fig. 2). The potency and efficacy of N33A 
in the induction of mononuclear cell migration was comparable 
with rhSDF-lot, suggesting that chemically synthesized N33A 
retains the tertiary structure and functions as well as rhSDF-la. 

The effect of SDF-1 on neutrophils is controversial (1, 3, 4). 
While some investigators failed to detect chemotactic activity of 
SDF-1 for neutrophils (1), others were able to induce significant 
Ca 2+ mobilization in neutrophils at physiologically relevant 
concentrations of SDF-1 (4). In an effort to clarify the activity of 
SDF-1 on neutrophils, we tested the binding and function of 
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Fig. 2. Chemotactic response of 
monocytes and T cells to N33A and 
rhSDF-lor. Different concentrations of 
N33A or rhSDF-la were placed in the 
lower wells of the microchemotaxis cham- 
ber. The cells were placed in the upper 
wells, which were separated from the 
lower wells by polycarbonate filters. After 
incubation the filters were removed, 
stained, and the cells that migrated 
across the filters were counted. The re- 
sults are expressed as the CI representing 
the fold increase of cell migration in re- 
sponse to chemokines versus medium con- 
trol. CI ^ 2 are statistically significant in 
comparison with the spontaneous migra- 
tion (in response to control medium alone, 
CI =1). 
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Fig. 3. Effect of N33A and rh SDF-1 a 
on human peripheral blood neutro- 
phils. The binding of radiolabeled ligands 
to neutrophils and the chemotactic re- 
sponses were examined as described for 
monocytes and T cells. Panel A, binding of 
N33A to neutrophils; panel B, binding of 
rhSDF-la to neutrophils; panel C, dis- 
placement of 1Z5 I-N33A binding to neutro- 
phils by unlabeled ligands; panel D, neu- 
trophil migration in response to N33A 
and rhSDF-la. CI ^ 2 are statistically 
significant in comparison with the spon- 
taneous migration. 
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N33A on neutrophils in comparison rhSDF-la. As shown in 
Pig. 3, human peripheral blood neutrophils expressed a sub- 
stantial number of specific binding sites for both N33A (Pig. 
3A) and native SDF- la (Fig. 3£). The binding is of high affinity 
with estimated K d values of about 5 nM. N33A and rhSDF-la 
competed equally well for each other's binding as shown by the 
displacement curve (Pig. 3C and data not shown). Neutrophils 
also migrated in response to both N33A and rhSDF-la, indi- 
cating that neutrophils are indeed among the target cell types 
for SDF-1. 

To further confirm that N33A utilizes CXCR4 as its func- 
tional receptor, HEK293 cells stably expressing CXCR4 
(CXCR4/293 cells) were employed. Wild type HEK 293 cells 
exhibited a low level of specific binding (about 200 binding 
sites/cell) for both N33A and native SDF-la. Both N33A and 
native SDF-1 also induced a weak but significant directional 
migration of wild type HEK293 cells (CI - 2.1 ± 0.2, at 120 nM 
ligand concentration). This is in agreement with the notion 
that a great variety of nonhematopoietic cells express CXCR4 
mRNA. However, HEK293 cells overexpressing CXCR4 ex- 
pressed markedly increased number of specific binding sites for 
radiolabeled N33A and rhSDF-la (Fig. 4, A and B). N33A and 
rhSDF-la mutually competed for binding to CXCR4/293 cells 
(Fig. 4, C and D). Both N33A and rhSDF-la were able to induce 
a remarkable directional migration of CXCR4/293 cells (Fig. 5) 



with similar potency and efficacy. HEK293 cells transfected 
with known chemokine receptors, including CXCR1, CXCR2, 
and CCR1-5, did not show any increased binding or activation 
by N33A or rhSDF-la over background levels, whereas these 
cells specifically bound and migrated in response to their li- 
gands (data not shown). These results demonstrate that N33A, 
like rhSDF-la, uses CXCR4 as a functional receptor. 

Chemokines are important mediators that participate in a 
variety of pathophysiological conditions including inflamma- 
tion, infection, tissue injury and repair, immune responses, as 
well as malignancy (6, 7). Recently members of chemokine 
receptor family have been identified to be fusion co-factors for 
HTV-l entry into host cells. Some chemokines were able to 
inhibit HIV-1 entry through competitive occupancy of the rel- 
evant receptors such as CCR5 or CXCR4/Fusin/LESTR which 
mediate the fusion of either monocytotropic or T lymphocyto- 
tropic viruses, respectively, with CD4 + host cells (2, 3, 13, 
23-27). SDF-1 has recently been identified as the ligand for 
CXCR4 and was able to inhibit the cell fusion mediated by the 
envelope protein of the T lymphotropic HIV-1 strain (23—28). 

SDF-1 is a member of the CXC chemokine subfamily and was 
first identified as a molecule possessing pre-B cell-stimulatory 
activity (29, 30). It was later described as a chemoattractant for 
resting T lymphocytes and monocytes (1). As a CXC chemokine, 
SDF-1 has several unique features in comparison to other 
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Fig. 4. Binding of N33A and rhSDF-la to CXCR4/293 cells. CXCR4/293 cells were gently detached from plastic flasks with trypsin/EDTA, 
washed, and measured for their ability to bind N33A and rhSDF-la. Panel A, binding of N33A to CXCR4/293 cells; panel B, binding of rhSDF-la 
to CXCR4/293 cells; panel C, displacement of N33A binding to CXCR4/293 cells by unlabeled ligands; panel Z), displacement of rhSDF-la binding 
to CXCR4/293 cells by unlabeled N33A and rhSDF-la. 
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Fig. 5. Chemotactic response of CXCR4/293 cells induced by 
N33A and rhSDF-la. CXCR4/293 cell migration was measured by 300 
min incubation at 37 *C in microchemo taxis chambers as described 
previously (21). CI s: 2 are statistically significant in comparison with 
the spontaneous migration. 

members of the same family. SDF-1 mRNA is expressed con- 
stitutively in virtually every tissue including heart, liver, lung, 
brain, muscle, spleen, and kidney (28 -30). Expression of SDF-1 
gene is not affected by proinflammatory stimuli (31), in con- 
trast to most other chemokines which are mainly expressed in 
response to proinflammatory cytokines and are believed to 
regulate the recruitment and activation of mature leukocytes 
at inflammatory foci (6, 7). In the absence of inflammation, 
blood monocytes constantly replace mononuclear phagocytes in 
the tissue, sustaining a stable level by extravasation from blood 
stream and undergoing differentiation into macrophages. 
SDF-1 displays a tissue distribution that is considered appro- 
priate for function in lymphocyte recirculation, in basal recruit- 
ment of monocytes and in normal replenishment and turnover 
of tissue mononuclear phagocytes (1, 30-32). However, neutro- 



phils do not normally infiltrate organs or tissues even though 
these cells express binding sites for and migrate in response to 
SDF-la in vitro, as further demonstrated in the current study. 
Thus, the role for SDF-1 as a primordial chemokine regulating 
primarily the tissue distribution of leukocytes needs further 
investigation. 

SDF-1 also has several essential functions in development 
(2). Mice lacking SDF-1 due to homologous recombination died 
perinatally and although the numbers of B cell progenitors in 
mutant embryos were severely reduced in fetal liver and bone 
marrow, myeloid progenitors were reduced only in the bone 
marrow not in the fetal liver, indicating that SDF-1 is respon- 
sible for B cell lymphopoiesis and bone marrow myelopoiesis. In 
addition, mice deprived of SDF-1 gene had a cardiac ventricu- 
lar septal defect (2). 

The SDF-la analogue N33A was chemically synthesized and 
was shown in this study to be equally as potent as native 
SDF-la in binding and activating human leukocytes as well as 
CXCR4-transfected HEK293 cells. N33A has also been shown 
to induce human B lymphocyte migration and to effectively 
inhibit infection of PM-1 cells by HIV-l(LAV). 2 The preparation 
of analogues is greatly facilitated by molecular chemical syn- 
thesis, in which proteins can be produced either singly or by 
combinational methods. The analogues could include a full 
range of genetically encoded amino acids as well as unnatural 
backbone structures and unclonable residues such as D-amino 
acids, fluorescent or nuclear magnetic resonance- sensitive nu- 
clei. The activities of analogues can also be tuned by fast cycles 
of synthesis-design-assay-resynthesis (33). The validity of this 
approach is demonstrated by the fact that it yields functional 
molecules such as the SDF-la analogue N33A, a biological 



2 M. A. Siani, D. A. Thompson, L. E. Canne, G. M. Figliozzi, S. B. H. 
Kent, R. Simon, J. Cyster, P. E. Kennedy, E. D. Smith, and E. A. Berger, 
unpublished observation. 
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contamination-free agonist of SDF-1 that can be utilized in 
studies of the immune system and host defense against AIDS. 
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Chemical protein synthesis 

Jill Wilken and Stephen BH Kent 

Chemical protein synthesis is a field in transition. Previously, 
the synthetic accomplishment itself was the major focus of 
work m this field. Increasingly, chemical synthesis is now being 
applied to understanding how biological function originates in 
the structure of the protein molecule. A novel approach - 
'chemical ligation', which is the chemoselective reaction of 
unprotected peptide segments in water at pH7 - has made 
the total synthesis of proteins a robust and practical route to 
the study of structure-function relationships. For certain protein 
families, chemical protein synthesis is the most effective way to 
obtain functional proteins direct from genome sequence data. 
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Abbreviations 

Aba L-u*NH 2 -/T-butyric acid 

ESMS e'ectospray ionization mass spectrometry 

rec recombinant 

SPPS stepwise solid phase peptide synthesis 

Introduction 

Proteins are the Inner it mu I gene products, responsible for 
essentially all rhc Jctiv icics of the biological world. For this 
reason, an understanding of protein structure and function 
is an important objective nf modem biomedical research. 
In recent veais, proteins have been most commonly stud- 
ied by over-expression in microorganisms using recombi- 
nanl (rev) DNA-bascd technology [I]. Techniques such as 
Mte-direered mutagenesis \d) have allowed rhe systematic 
variation of protein structure and have vastly increased our 
understanding of the protein world. 

The preparation of protein molecules by chemical synthe- 
sis has Ion}:; been an objective of researchers interested in 
the moleeular origins of protein function |H The goal of 
total chemical synthesis of protein molecules provided an 
incentive to classical organic chemists over a period of 
decades and culminated in the synthesis of insulin and a 
series of disulfiile-mispaired (and hence deliberately mis- 
folded) insulin analogues |4l. More recently. Sakakibara 
|5] has persevered with the classic solution elicmistiv 
approach to the rotal synthesis of proteins, by means of 
convergent chemical condensation of fully protected pep- 
ride segments in organic solvents. This technique has 
scored a number of successes, such as the total synthesis of 
midkhu: |(>1 and related molecules, and has been used for 
.snub of srritcture-aerivit\ relationships 1 7]. 

\Icrri field's stated coal in developing rhc stepwise solid 
nhasc neotidc s\n thesis (SPPSI method was the rotal 



chemical synthesis of proteins [H|. In rhc SIM'S technique, 
protected amino acids in organic solvents are added one at 
a time to a resin-bound peptide chain, resulting in the 
assembly of the target peptide in fully -protected, resin- 
bound form. The prod iter polypeptide is released by 
deprotection and cleavage front the resin support. 
Perhaps the ulrirnatc accomplishment of (he stepwise 
SIM'S method was Schneider's total synthesis of crys- 
talline HIV-I prorease \% Subsequently, Wlodawcr e? oL 
[10 1 used this chemically synthesized protein to obtain 
the first correct crystal structure of this enzyme. 
Interestingly, Miller vt ai. [11.121 obtained the first co- 
crystal structures of the HIV-1 prorease with stibstiarc- 
de rived inhibitors bound in the enzyme molecule, using 
protein prepared by total chemical synthesis. The result- 
ing structural data were made freely available to the 
research community and formed the basis of the world- 
wide structure-based design programs that led to the 
development of rhc highly successful 'protease inhibitor* 
class of AIDS therapeutics 

Wells and co-workers |14] have used enzymatic ligation of 
unprotected peptide segineius for the total chemical syn- 
thesis of proteins, developing enzymes specifically engi- 
neered for this purpose. Thus, in a classic study. KNasc A 
and analogues were prepared using 'sulniligase* (an engi- 
neered form of subtilisin). In rhc current review period. 
Wong and co-workers have used eir/vmaric peptide 
re-ligdtion and cn/.ymatic glycosylation for the production 
of ribonuclease \\ glycoforms. 

The search for more effective routes ro the understand- 
ing of protein function has gained added urgency, 
because of the success of the expressed sequence tag 
genome projects. Huge numbers (>l()0JMI0) of 'proteins' 
have been discovered in the past two or three years, bui 
are at presenr frequently known only as open reading 
frames in a computer sequence database |ln|. This has 
led to a research bottleneck that was unanticipated as 
recently as three years ago; how can biomedical 
researchers obtain proteins, the 'functional gene prod- 
ucts', from database sequences in a timely and cost-effec- 
tive manner? 

Fortunately, a new technology has emerged that suggests 
that this unanticipated problem has an equally unlikclv 
answer: total chemical protein synrhesis! This review 
selectively focuses on rhis most promising of new prnrcin 
technologies, the synthesis of functional proteins directly 
from genome sequence data by 'chemical ligation', that is. 
the chemoselective reaction of unprotected peptide seg- 
ments in warer at pH7. 
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Functional proteins directly from genome 
sequence data 

As discussed abn\c. the HIV- 1 protease molecule was one 
of the first examples of the use of chemical protein syn- 
thesis ro uain direct access ro rhe functional protein, when 
all thar was know n was the putative polypeptide sequence 
encoded in genome data 1°-). 

More recent ly. ( !lark-l a: wis r/ at. |17] have \cry effectively 
used toral chemical protein synthesis by rhe srepwisc SPPS 
method (h'igure 1 ) to establish much of the basic knowledge 
in the chemokine protein family [171. Again, the approach 
has been to prepare the prorcins by total chemical synthesis 
based on DNA sequence data. In addition ro providing 
direct access to functional wild-type proteins, Clark-Lewis 
d at. \ M\ have also made unique analogues to study impor- 
tant aspects of chemokine biology. Notably, in order to study 
rhe functional role of protein dimer fommation rhey con- 
structed an 'obligate monomer' form of IL-S This 
chemical protein analogue had a methylated backbone 
amide bond that fire vented the formation of dimer ic protein 
in solution, and yet retained full biological activity [1 Al- 
chemical ligation 

'The most imporcunr recent development 1 in the total syn- 
thesis of proteins is the chemical ligation method |20|. The 
underlying concept of this novel approach is shown in 
bigiue 2. ( ihcioical ligation uses the principle of chemnse- 
lectivc reaction — that is. rhe reaction of two large (.vS-o() 
amino acid residue) unprotected peptide segments in 
water at neutral pH. to yield product polypeptides up to 
about 1-0 amino acids in length. An unnatural moiety 
icplaces the native peptide bond at the site of ligation. A 
chaorropic agent, such as 6 M guanidine.Hdl, is used to 
increase rhe concentration of rhe reacting peptide seg- 
ments ro i -S iviM. insuring rapid and complete reaction. 
Quantitative ligation is typically obtained in a matter of 
hoots, and the polypeptide product is simply isolated in 
one step by preparative re\crsc phase HI'Ld I'll. 

I se of unprotected peptide segments has a number of 
adv antages. Synthetic peptides up to -60 amino acids can 
be readily prepared by highlv optimized stepwise SPPS 
|22|. Unprotected peptides can be purified to a high degree 
of homogeneity by standard techniques, such as reverse 
phase I IPL(i or ion exchange chromatography. The entire 
process, including both chain assembly and purification, 
takes less than 2-1 hours. Furthermore, unprotected pep- 
tides can be characterized very effectively by techniques 
such as elect rosprav mass spectrometry w hich rely on ioniz- 
a hie (i.e. tin protected) functional groups for their effective- 
ness, f'inalh. unprotected peptides frequently have good 
solubility in aqueous solution: if necessary, their solubility 
t an be dramatically enhanced by the use of a chaotropie 
agent such as oM guanidine.H< !l [201. 

The chemical ligation approach appears to be generally 
applicable. It has been used to make a wide range of protein 
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Chemical protein synthesis by means of stepwise SPPS. Because of 
the accumulation ot resin-bound by products, the size of polypeptide 
that can be synthesized in this way is restricted. Stepwise SPPS is 
limited to the synthesis of small proteins. AA, amino acid residue. 



molecules, rouging op to 22 kl)a in size |23] (Table 1). 
Proteins prepared by total chemical synthesis in this man- 
ner hav e rhe expected biological activities, and are correct- 
ly folded. In several examples, techniques such as NMR 
[24) and X ray crystallography |25'*| have been used to 
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Figure 2 
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I Cuneri Ootnion .n BiOlechrology 

Principles of chemical ligation. Two completely unprotected peptide segments are covalently joined by means of the chemoseleclive reaction of 
unique, mutually reactive functional groups. In the original version of this approach [20], an unnatural (i.e. non-peplide bond) motety is formed at the 
ligation site. 



elucidate the rlucc dimensional structure of proteins pre- 
pared hv loud chemical synthesis, using the chemical liga- 
tiun method in irs original form. 



amiiuMerminals and no earhnxy-tcrminal! This fused bi- 
directional polypeptide folded efficiently ro give :i ptoiciu of 
>().(■> kl);t with the c\peered DfSA-binding activities |2o|. 



Another interesting example of the use of the chemical liga- 
tion method in its original form ro ehieidare novel aspects of 
rhe molecular basis of protein function, i.s the total synthesis 
of a cnvalcnt hctcrodiotcr made tip of the b/HI ,1 [/'/. regions 
of the transcription factors cMyc ami Max [in]. Cannc (1 nL 
used convergent chemical ligation of four large peptide 
segments to make a synthetic protein construct that, 
although a polypeptide chain with linear topology, had two 

Table 1 

Proteins prepared by chemical ligation 



Protom Size References 

No. amino acids 



b/HLH transcription factor 180 12G| 

HIV-1 protease 2 * 99 1201 

D-HIV-I protease 2*99 [21] 

Flap engineered HIV-1 protease 2 ■•: 99 [54] 

HIV-1 prolinase (tethornd aimer) 202 [23] 

Intfirgnn yb3a nonprotein 126 [89| 

Ftbroneotin 10F3 domam 94 [24| 

Cytochrome b TASP 1 22 [69'1 

Artificial proteins 195 |B7) 



Native chemical ligation 

In 1 ( J°4. Dawson ciai. introduced a remarkable refine* 
incut of the chemical ligation method in which two unpro- 
tected peptide .segments were reacted to directly yield :i 
polypeptide product with a native peptide bond ar the site 
of ligation. 'Native chemical ligation' as this technique is 
called, has proven to be a robust and practical method for 
the total synthesis of proteins. The principles of native 
chemical ligation are shown in Figure .v The method 
involves rhioesrer-mctliaied chcmnselccriv e reaction of 
two unprotected peptide segments in aqueous solution at 
neutral pi I, and results in ligation at a cysteine residue via 
a native amide bond. A typical example is the total syn- 
thesis of the enzyme human secretory phosphn lipase A 2 
(124 amino acids. \ A cysteine residues in seven disulfides) 
|2K*|. The native chemical ligation method has been the 
subject of mechanistic srudies |2°*T and Iv.is been 
rev iewed Kver increasing numbers of proteins ha\c 

been made u.sinu this rcehnU|uc (Table 2). Keen use ol' its 
speed and efficiency lot small, cysteine-rich proteins, 
native chemical ligation has become the method of choice 
for the rapid preparation of wild-type (i.e. nativ e) proteins 
direcrlv from ueuome sequence dura 
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Figure 3 
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Native chemtcni ligation. In a refined version of the chemical ligation approach, two unprotected peptides are reacted in aqueous solution at pH 7 
to yield a polypeptide producl with a native peptide bond at the ligation site [27]. This is accomplished by thiol exchange between the carboxy- 
torrnmal th'oestcr of one peptide and the Ihiol side chain of the cysteine residue at the amino-terminal of the other peptide. Rapid, irreversible 
intramolecular nucleophiltc attack yields the native peptide bond. 



The protein folding 'problem 5 

('.'he mica I synthesis of large polypeptide chains is a neces- 
sary step in the tora] synthesis of proteins, bur is nor in 
itself sufficient: the polypeptide product must be aoenrare- 
l\ folded co form the unique three-dimensional structure 
of i he protein molecule. It is this three-dimensional struc- 
ture that determines the activity of the piotcin. Several 
decades ut;o, Anfinsen [M\ hypothesized that the linear 
sequence of amino acids in a polypeptide chain was in 
itself sufficient to determine the precise three-dimensional 
structure of the protein molecule. Abundant evidence 
from /;/ vitro folding studies of isolared proteins has sup- 
ported this hypothesis. Indeed, because it is the linear 
amino acid sequence alone that is encoded by the ON A 
sequence of the gone, it must of necessity determine the 
functional form of the protein. 

More recently an elaborate cellular machinery associated 
with protein folding /'// vivo has been elucidated |.VH 



Although none of the research into this 'chapernne' appa- 
ratus has been interpreted as contradicting Anfinsen's 
fundamental assertion, it has nonetheless come to be 
widely believed that special cellular conditions arc 
required for accurate protein folding, especially of larger 
multi-domain proteins. Such skepticism has been rein- 
forced by the frequent observation that proteins that, are 
rceombinantly expressed at high levels in microbial cells 
form inclusion bodies [34) (i.e. intracellular precipitates); 
proteins recovered from these inclusion bodies are often 
difficult to fold efficiently ro give active protein mole- 
cules |35|. This mind-sec has caused considerable skepti- 
cism about the possibility of accurate /;/ vitro folding of 
chemically synthesized large polypeptides to form native, 
functional protein molecules. 

Concerns about the correct folding of synthetic polypep- 
tide chains can only be effectively answered by experi- 
ment. So far. in rhe chemical synthesis of more than. 200 
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Table 2 

Proteins prepared by native chemical ligation. 

OasG/tamily 



Size No. of References 

No. amino acids Cys [disulfides] 



Secretory 

Chemokines 68-90 

Ser PR inhibitors 58-70 

AGRPs 46- II 2 

Anaphlyatoxins 72 

fiGFs/TGF-a 50 

Receptor/membrane 

(i2 Microglobulin 99 

Intracellular 

SH2 domains 95 

SHH domains 60 

Transcription factor 77. 

Redox 

DosuHoradoxin 2 x 36 

Rubredoifin 53 

Cytochrome bb 82 

Enzymes 



4 [21 [31*.48l 
6 [None! (37,4 1,55*| 
1 0 [51 [D Thompson! 



[31 
[3] 



2 [11 



V (None] 

r 

6 [Nonel 



[R Simon] 
[M Siani] 



[J Wilkenl 

(T Muir] 
[J Wilken] 



4 iNonej [D Lowj 
4 [None] 

r [None] 1C Hunter] 



Retroviral proteases 2 : 


« (99-U6) 


2 


[Nonel 


fS Walker) 


Human seoPLA2 (G2) 


124 


14 


[7] 


[28-] 


Macrophage migration 


3x 115 


2 


[None] 


[L Canne] 


inhib. faclor 










Barnase 


110 


r 


[Nonel 


[29"] 



'No. native cysteine residues; a single cysteine introduced to form a 
ligation site. 



proteins from numerous families and falling into many dif- 
ferent classes (Tables 1 and 2), efficient folding of synthet- 
ie polypeptides to form functional protein molecules has 
been observed without exception. This is true even of 
polypeptides eon tain ins large numbers of cysteine 
residues, which must fold to form a precise partem of 
disulfide bonds, frequently regarded as a daunting task 
|.V)|. because, large umimnts O100 milligrams) of high puri- 
i\ pi >lv peptide are obtained in a single step from chemical 
ligation of unprotected peptide segments |37|, it is possi- 
ble ro use several one milligram samples to undertake a 
preliminary examination of :i range of typical folding con- 
ditions in a matter of only a few hours. Optimal conditions 
can ihcn In: used for folding of the bulk of the synthetic 
material. An example of the folding of a small, cysieinc- 
rich chemically synthesized protein using this approach is 
shown m I' igure 4. 

It mav be that the invariably efficient folding of chemical- 
ly synthesized polypeptides is simply a matter of starting 
with abundant amounts of material of high chemical puri- 
ty Alternatively, the ex peri men rally observed facile fold- 
ing of proteins prepared by total chemical synthesis may 
be due to the fact that the preparation of chemically syn- 
thesized polypeptides docs not proceed via an intermedi- 
ate 'inclusion hcH.lv' (i.e. aggregated) state. 



Analytical characterization 

Synthetic protein chemists of the late 20 th century have 
the advantage of powerful analytical techniques for 
determining the chemical structure of the protein mole- 
cules that they construct. 

The covalent structure of synthetic protein molecules can 
be rapidly and accurately determined by eleetrospray ion- 
ization mass spectrometry (KSMS) [3S]. Unprotected 
polypeptides, regardless of size, are efficiently analyzed 
in this way. In just a few minutes, the mass of a polypep- 
tide can be measured with an accuracy of better than 1 
part in 10.000. For the approximately fifty-residue unpro- 
tected peptide segments used in protein synthesis by 
chemical ligation, this means that the mass of each seg- 
ment can be determined to belter than ± 0.S Da. Such 
accurate mass measurement, together with the knowl- 
edge of the amino acid sequence and the synthetic pro- 
cedures used, is powerful confirmation of the correct 
chemical structure of the peptide segments used as 
building blocks in protein synthesis by chemical ligation. 
KSMS also yields useful data concerning purity of 
polypeptides 13°]. This KSMS data complements and 
extends that obtained by more traditional analytical tech- 
niques, such as reverse phase HPL(* [401. Ligated 
polypeptide products are also characterized by KSMS 
with the same precision as the peptide segments 
[28V29*WU7]. Furthermore, for synthetic functional 
domains of typical size, the mass accuracy is sufficient to 
determine that the correct mtmber of disulfide bonds has 
formed in the folding. For example, the folding of the o7 
residue ligated' synthetic polypeptide (measured mass 
7791.6 Da) to form the chemokine SDK- la (measured 
mass 7787,2 Da) resulted in the loss of 4,4 Da, precisely 
corresponding to formation of two disulfide bonds |3t*|. 
In another example of the use of mass spectrometry to 
evaluate correct protein folding. KSMS was used to char- 
acterize the insertion of metals into a zinc-finger protein 
domain (Figure 5). 

The three-dimensional structure of the folded synthetic 
protein can also be rapidly evaluated using modern mul- 
tidimensional NMR techniques. The standard research 
scale of chemical protein synrhesis gives multiple tens- 
of- milligrams of high purity synthetic protein directly 
from sequence data. Such amounts are ideal for NMR 
studies. Simple multi-dimensional 'fingerprint* measure- 
ments will quickly tell whether a synthetic protein has a 
unique folded form |4I|, or consists of a mixture of forms 
|42|. An example of such a measurement on the protein 
AOI'-RANTKS. prepared bv total chemical synthesis, is 
shown in Figure o. More complete NMR measmcments 
on a synthetic protein can be used to experimentally 
determine the complete three dimensional structure of 
novel molecules 143]. 

X-ray crystallography can also be used to determine the 
three-dimensional structures of novel proteins prepared In 
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Figure 4 
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Efficient folding ot a chemically synthesized protein. "Hie uysteine-nch carboxy-terminal domain of Ihe Agouti-related protein, termed 'minimized 
Agouti-related protein* (MARP). has 46 amino acids and includes ten cysteine residues. Folding of such small cysteme-nch polypeptides is 
regarded as difficult 1361. As can be seen here, the synthetic polypeptide containing a she-specific biotin label, folds efficiently to give a good yield 
of high purity labeled piotem (ot lower left; 'Ac-MARP[Lys 1M:s (N F Biotin)'). Multidimensional NMR measurements show that this synthetic protein has 
a single defined three-dimensional structure, and bioassays show that the protein retains all the activities of the Agouti-related protein 
(DA Thompson, personal communication). 



tniiil rlieunral synthesis. A number of proteins prepared by 
r«tr;i1 eheinienl s\ nt lit; sis hnse.il mi ueimme sequence dat;i 

have been crystallized and shown to d if tract X-rays to hij^h 
lesoliitinn 1 10,1 1 .2W44], The crystal struct tire of a chem 
iealK synrhesi/.cil protein is shown in Figure 7. 4*1 te 



incorporation of [4Se)Met into a protein by chemical syn- 
thesis, together with use of a synchrotron radiation source 
for the diffraction studies, provides phasing information 
that can be used to solve a structure without making mul- 
tiple isomorphous heavy atom crystalline forms. Initiatives 
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Figure 5 
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insertion of metals into the chemically synthesized ztnc finger protein DSX, the DNA-binding domain of the sex-determining gene product in 
Dtosophitu, observed by electrospray mass spectrometry, (a) Unfolded DSX synthetic 72 amino acid polypeptide: observed molecular mass 

8355.2 ± 0.25 Da (calculated 8353.8 Da. average isotope composition), (b) Folded DSX (Zn2i) 2 protein: observed molecular mass 

8480.3 ± 0.4 Da. The mass difference between the unfolded synthetic 72 residum polypeplide chain and the folded, melal-contairvng protein was 
25.1 £0.7 Da in good agreement with tour ionized cysteine side chains binding two Zn ions (i.e. 2 x 65.3 Da - 4 x 1 Da = 126.6 Da). Note the 

change in charge state distribution characteristic of proteins ioni7ed from the native, folded state. 



j re under way to use X-ray crystallography anil chemical 
protein synthesis to obtain experimental structures for 
entire families of molecules. 

Functional characterization 

1 1 i-i no ins prepared by tor a I chemical synthesis arc rourinely 
assaved lor biological and biochemical acib ity Afrcr fold- 
ing, purified symbolic proteins have been shown ro have 
activities imlisrin^itislvablc from reference materials, typi- 
cally proreins purified from reel) \A-c\pression systems 
\.\\ *|.. More and nunc frequently, however, the 'cloned' pro 
K'.in 'in no) available in a vimelv fashion and the chemically 



synthesized protein becomes the reference material for lhar 
field of research 14.S.46|. 

A good example of this is the chemokinc family of pmtcins. 
The chemokines (the name is a contraction of '///rwnac- 
tunt cyt^>/7//^.v , ) are a family of small (-70 amino acid) pro- 
teins that have a conserved pattern of cysteine residues 
(usually four). They are involved in all aspects of human 
inflammatoiv and immune responses and ha\e other activ- 
ities, such as hematopoietic effects on bone marrow cells. 
Recently, chemokinc receptors have been shown ro be the 
essential co-receptors lor 1 1 1 \\ and the chemokines 
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Figure 6 
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'H NMR measurements on the synthetic: protein AOP-RANTES. Data , 
shown is a 2D TOCSY measurement performed at 500 MHz. Hie data 
Heliiius a smyle defined 3D structure, consistent wilh the canonical 
chemokino Itiid. Full assignment and interpretation of a complete set 
on NMR measurements is under way (PN Barlow, H McSparron, 
DA Thompson and J Wilken, personal communication). 



themselves can block the eulrv of HIV- 1 into human cells. 
K01 these reasons, chemokines arc of j^rcai interest to the 
biomedical research communiry H7J. furthermore, 
because of their small si/.e and disci net i\c partem of cys- 
teine residues, chemokincs have been identified in increas- 
ing numbers in the expressed sentience ui.u nucleic acid 
M'c|ncnce databases. Moil- than 4t) human cheinokine pro- 
teins have now been identified, together with a j^rowintf 
number of viral chemukincs active against human receptors. 

(Ihcuiical protein wnrhesis has been used to elucidate 
much of the original knowledge of the biology of the 
chemokincs ( 1 7|. A case in point is the chemokine SDK-lu, 
1 he n a rural liearni for the CXCR4 receptor. CXCR4. for 
mcrh known as- I .esie 1 *>i Kusiu. is the corcccptoi Inr lllY-1 
eni r\ inro human T i\:\\s. S| )K - 1 rx was ident ificd as t he nat- 
ural liuaod for CXCK4 usint: chemokine prepared by coral 
chemical swuhesis M7.4;y4iSh most follow up srudies on 




Three-dimensional structure of the chemokine [N3,3A]SDF-tu 
determined by X-ray diffraction studies performed on crystalline 
chemically synthesized protein [44], The polypeptide backbone is 
shown, together with the space filling van der Waais contact surface of 
the protein molecule. 



SDK- lot have also been undertaken usin.n material pre- 
pared by chemical protein svnrhesis |4«S|. When material 
from recDNA-based expression in i%. soft finally became 
available, ir was shown to haw activity indistinguishable 
from the reference synthetic SDK-la prepared by native 
chemical ligation (Figure 8) |31*|, 

Multidomain proteins 

Based on abundant experimental evidence, there can be 
no doubt that synthetic polypeptide chains corresponding 
to individual protein domains fold efficiently and correctly. 
These domains are the basic building blocks of the protein 
world. The demonstration that a wide range of fully func- 
tional domains can be efficiently formed from chemically 
synthesized polypeptide chains constitutes strong evi- 
dence that the world of protein function is generally acces- 
sible 10 the synthetic chemistry approach. Admittedly, thus 
far only a limited number of multidomuin proteins have 
been prepared by total chemical svnrhesis (see Tables 1 
and 2). Nonetheless, we can be ahsolurcK eon fid en r of 
general access ro large multidomain proteins because, if 
necessary, the individual domains can first be folded, then 
stitched together as folded protein molecules us inn estab- 
lished chemical techniques, such as nxime-formint» liga- 
rion. first pioneered with synthetic manipulations of 
recombinant proteins [4 l i|. 

Applications of chemical protein synthesis 

Total synthesis prov ides for the 'straight forward application 
of the rools of chemistry 10 the world of proteins. This has 
a number of immediate and useful consequences. 
\li>lc<:nlai const mets that are highh desirable but that 
previously have been virtually impossible ro make, even 
usinc elaborate novel rccDNA-based /// vifro expression 
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Figure 8 
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B'Oiogical assay data for IN33AISDF-1 rx versus recHumanSDFl -a. Inhibition ot CXCR4-mediated HIV- 1 -induced coll fusion by increasing 
eoneunimlions of the chc-mokine SDF- la. (a) Recombinant human protein. Cb) Chemically synthesized fN33A]SDM<x. Two different HIV 
unvKlopes weie used: Ba-L usws the chemokine receptor CCR5 as an entry co-factor; LAV uses CXCR4. The natural ligand for CXCR4 is the 
chemukine SDF 1 u, thus increasing concentrations of SDF1 -a inhibit LAV-mediated fusion, but have no effect on Ba-L mediated fusion. {M Siani, 
E Berycr, personal communication). 



technologies |5()|. become accessible in a direct unci practi- 
cal fashion by chemical protein synrhesis. 

Non-coded amino acids 

Nnn coded amino acids can be readily incorporated into 
proteins bv total chemical synthesis. An early example of 
the power of this appioach was the |Aha'* 7 tJ5U ' ?1,>s ]H IV- 1 
pi oi case molecule prepared by total chemical synthesis [ 9] 
;tn(! used feu the original X-ray crystallography of this 
important target for drug design 1 1(1.11] (sec above). The 
I'.mr cysteine residues in the enzyme molecule were 
replaced by L -cx-M ■ l r //-hutyric acid (Aba) residues, efTec- 
civcly tcplaeiiigihe side chain -SH moiery uf each cysteine 
residue with the isostcric -CH-, group. By removing 
unpaired cysteine Milfhydryl groups, this .substitution 
u.ieatlv improved rhe handling properties of the protein 
uithoiir alTecting its structure and en/.ymaric pi open ic* 
| ( )- 1 i |. Main subsequent examples of the incorporation of 
non-coded ammo aeids b\ total chemical protein synthesis 
have been reported, some involving more elaborate ammo 
acid replacements. Mich as the incorporation of a geometri- 
cal constrained fl-inrn dipeptide mimic into a functional 
pioiein |SI|. Similarly constrained helical peptidomimer- 
ie.s have been reported and eon Id also be used as 
building blocks for the chemical const met ion of proteins. 

Protein structure-activity relationships 

Chemical protein synthesis lias been used to carry out 
iniiiim: studies elucidating the structural origins of protein 



function, For cvample. Love vttti. |5.>'|. prepared a novel 
form of the protein ubiquitin. in which two amino aeids in 
the hydrophobic core were replaced by non-coded 
residues; Val2n->Aba; and Ue.W^Nva ( I Valine) (Aba 
being straight chain aliphatic [-CfTCHj and N\a being 
straight chain aliphatic [-(IHiCIKCHJ). In each case, 
rhese changes amount to the deletion of a .single methyl 
group from the native sidechain. The synrhetie protein 
variant was crystallized ;tnd X-ray diffraction showed an 
overall contraction of the molecule, and the introduction of 
a small internal cavity, which decreased the stability of rhe 
protein. The modified protein showed unaltered biological 
activities 

Protein backbone engineering 

The systematic modification of the chemical nature of rhe 
polypeptide backbone structure, ami the correlation of 
such changes with effects on protein function, is also made 
uniquely possible by chemical protein synthesis. Kaea and 
Kent [54] used deletion of H-hund donor moieties (by con- 
version of-CONH- to -COS-) in IIIV-1 protease prepared 
by ehemieal ligation to show that protein 'flap' backbone- 
to-substrate hydrogen bonds, mediated by a lerrahcdialb 
coordinated water molecule, were essential to full catabtic 
activity of this enzyme. More re cent! v. Uaca and co- 
workers [2V*| extended and refined tins work b\ using 
convergent chemical ligation to constrmr a 22 kl)a cova- 
lent tethered dimer form of the HIY-I protease in 
which the backbone H-bondim* clement of onl\ a single 
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fkip was deleted. This backbone -engineered HIV-1 pro- 
tease retained full intrinsic catalytic activity 1 25**]. Such 
•en lilies suggested thai the Hl\-1 protease enzyme mnle- 
ctile. a homodimcr of identical W residue peptide chains 
constituting a single acrivc: sire, uses only a single flap in 
catalysis, in a manner analogous to the single polypeptide 
chain, two domain cell-encoded aspartvl protease cn/ymcs 
|2V'|. This finding lias significant implications for the cor- 
rect understanding of the mechanism of the HIV-1 pro- 
tease, and for the design of improved prorcase inliihiror 
A 1 1 )S therapeutics. 

Intermotecular hydrogen bonding 

backbone to -backbone amide hydrogen bonds arc fre- 
quently observed in pmtcin complexes. 'I 'he contribution 
of inienooleeitlar backbone hydrogen bonding to 
prntein-pnuein i me ructions was experimentally quanti- 
fied by specific deletion of a single 1 bbonding element 
(i.e. conversion of a backbone -CONH- to -COO-) in syn- 
thetic turkey ovomucoid third domain, a small serine pro- 
tease inhibitor protein Kvaluarion of the binding of 
I his backbone ester-containing protein inhibitor molecule 
it* a panel of serine proteinases .showed that this hydrogen 
bond mimibiired 1 ..S ± 0.3 kcal/niolc to the formation of 
the en/A me- ligand complex. Such studies are made 
uniquely possible b\ chemical protein synthesis. 

Mirror image D*proteins 

Total chemical synthesis can be used to prepare polypeptide 
chains made up entirely of D-amino acids, rather than the 
I. amino acid polypeptide chains found in nature as rhe 
products of ribosomal synthesis. Such D-amino acid 
pol\ peptides fold with high elTiciency to form proteins of 
defined rhree-dimcnsional s truer ore; the resulting protein 
molecules are rhe mirror image of the native I, -proteins, and 
display reciprocal cbiral properties (i.e. circular dicliroism 
specua: mirror image folds by \MR and b> X-ray eiystal- 
Ingraphv; minor image substrate chiral specificities in 
cm/a rnes). An early example of the total synthesis of mirror 
invade proteins was the D- and L -forms of the tr>*psin 
inhibitor microprorein KKTI II |5o|. "J'he protein enan- 
liomcrs I)- and L-rubrcdoxin prepared by stepwise SPPS 
|.S7|. and I )- and I,-IIIY-1 protease prepared by chemical 
ligation [*?!]. were both studied by X-ray crystallography. 
The three-dimensional structure of ligated D-I1IV-1 pro- 
least: and rhe mechanistic significance of this work and 
other chemical variants of this important enzyme are dis- 
cussed by Miller // ///. |2V|. More recently. Schumacher 
/•/ .it. o^etl mirror image SH.i domains, prepared by 

cbeniK.al synthesis, to identify novel brands in phage pep- 
tide libraries h\ mirror-image phape display'. There is also 
:i :ecenr pivtiiiiiiurv report of cry sialography < in the mtitoi 
imai'c forms of (he sweet protein monellin |.S u f Mirror 
image prorcins can be used to obtain crystal structures of 
greater accuracy |57|, and to discover novel lead compounds 
horn thiral libraiics |5N|. Although mirror image proteins 
could be expeeted to have improved properties in vho (e.g. 
resistance to proteases, little or no imnuimigenicity\ only 



I )-proreins that work on achira! ligands (e.g. superoxide dis- 
mutasc) will retain their biological -activity. 

Glycoproteins 

The chemoselcctive reaction principle, in which tinptotcct- 
cd macromolecular building blocks arc set up to react with 
one another to yield defined ligation products of unique 
struct u re, also can be applied to the total synthesis of glyco- 
proteins. Zhao.fV ✓//. [<)()] linked complex carbohydrates to 
synthetic peptides by reaction of the aldehyde or ketone 
moiety at the reducing end of the carbohydrate with an 
NH.OCHv- moiety synthetically introduced on the pep- 
tide, to give the corresponding homogeneous oximc-linked 
glvcopcptidc of de lined molecular composition. Berto/.xi 
and co-workers used chemoselective ligation with- 

in the carbohydrate structure to elaborate synthetic gly- 
copeptides, and have applied this approach to elegant 
glycoprotein synthesis. Chemoselcctive synthesis of glyco- 
proteins of defined homogeneous molecular structure would 
have profound biotechnology applications in the field of 
human biopharmacenricals. The current srnre of chemical 
approaches to glycohiology has been review ed [<\\]. 

Circular proteins 

Topologically circular (i.e. cyclic) polypeptides that fold to 
form proteins are a novel class of molecules uniquely 
accessible by chemistry. Mtiii and co-workers [ 04 * * | 
reported the synthesis of a topologically circular polypep- 
tide chain corresponding to the WW domain of the intra- 
cellular signaling protein Yes kinase-associatcd protein 
(YAP). The 4b residue polypeptide chain was converted to 
cyclic form (i.e. a topologically circular polypeptide, with 
no amino-terminus, and no carboxy-terminus) using the 
native chemical ligation method |27| as applied to cyclic 
peptides |bS, 6b]. Interestingly, rhe circular polypeptide 
folded efficiently to form a protein molecule w irh activity 
and three-dimensional structure indistinguishable from 
rhe native (linear polypeptide) protein. Circular proteins 
could have great potential application in biotechnology; 
they would be expected to display increased thermal sta- 
bility and co resist the action of exoproteases. 
Fundamental insights into protein folding will be 
obtained by the study of these protein molecules formed 
from polypeptides with circular topology. 

De novo design 

Chemical synthesis can be used to construct artificial pro- 
rcins hased on <iv novo molecular design. The principles 
underlying this chemoselcctive reaction approach were 
elegantly demons 1 i a re t I by Rose in 1W4 |07|. More reccnt- 
K. a number of examples have been reported of rhe total 
synthesis of TASP (Template Assisted Synthetic Protein) 
molecules, using the modular chemical ligation approach 
first applied to TASPs by Dawson loH| and Rose [o7]. 

These include rhe construction of a 14.3 kDa heme- 
eontaiuing TASP [b ( > # J. Practical considerations governing 

TASP design and construction by chemical ligation have 
also been investigated [70|. 
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OcCrado !7J| icccntlv summarized the current state of 
the :in in tlu- ///• i.'ui'f design of protein molecules. 
Significant progress h ■ i s been made on >c\cral fronts, 
including Kc^an and co-workers' \H\ piorem st i ti ti u r:i I 
redesign (systematic changes of amino acid sequence 
causing a protein to form a tlifTcrcnr predetermined 
fold). and rlu: use of design principles ro equip proteins 
with new functions 1 7.^ |. True r/r novo design of novel 
polypeptide se<;uenees encoding a predetei mined pro- 
tein fold Iv.is :dso advanced considerably, ;md Dahiyat 
:ind Ma\o |74'| have rested rhe success of their novel 
predictive algorithms h> chemical synthesis and struc- 
tural characterization of the designed molecules. 
Chemical protein synthesis can plav a pivotal role in the 
practical evaluation of protein design because if provides 
;i diicct m ml c: to the protein moleenk: without the inter- 
vention of the eMnmcoos and potentially complicating 
factors found m h\ ing systems 

Conclusions and future perspectives 

The literature in rhe past year or so is characterized by a 
gi owing nuinbci of esainplcs of I he sltidy of the proper- 
ties of ;i \ aricty of small proteins produced by total chem- 
ical synthesis |75-NI)|. This is evidence that the field of 
chemical protein synthesis is moving from the pioneering 
' feasibility** stage to the use of chemically .synthesized 
proteins for serious study of protein function. Total syn- 
thesis m s i c i Li the chemical ligation of unprotected peptide 
sei'jiu-nis in aqueous soluiion has already provided robust 
and practical synthetic access to proteins in the 
/.()()(> -22.00(1 Da size range. Improvements (see below) 
promise to dramatically expand the type and size of mol- 
ecules w hich can be studied by ihc application of chem- 
rsti \ to the u orld of ptoirius. 

NovdI methodologies 

Solid phase chemical protein synthesis 

Ciniic vt ttl. |S| | have provided a preliminary report that 
describes the extension of the chemical ligation princi- 
ple to pol\ mcr-siippnittid synthesis. 'Solid phase chemi- 
cal ligation', the chemical ligation of unprotected 
peptides on a polymer support, rakes advantage of rapid 
purification by filtration of intermediate polymer-bound 
hi'.arion products and. thus, makes feasible the scqucii- 
r i ; 1 1 ligation of tip to eight peptide segments. The piiiici- 
pU:s nf ilu: approach are show n in l-'ignrc 0. Completely 
unprotected peptide segments were condensed in aque- 
ous solution on a watcr-compat ible solid phase. The 
method was il In sr rated with the synthesis of the 115 
amino acid polypeptide of macrophage migration 
inhibitory factor b\ setpiential ligation of four unprotect- 
ed peptide segments |M |. Kvcn at this early stage of its 
de\ rlopinciH, solid phase chemical ligation promises t.o 
provide facile and robust synthetic access to pulypcp- 
tides in the 120-200 amino acid size range. This corie- 
in rhe si/e range (it* protein 'domains', rhe 
;: no moi nous units of folding and function rhar are the 
building blocks of rhe piotein woild |S2|. 



Expressed protein ligation 

Muir and co-workers |K.VVS-I| have reported a ret ON. V 
bascd technology that, together with the native chemical 
ligation method in its original Ibim |27|, promises to fully 
inregrare the worlds of reel .)NA-bascd piotein expression 
and chemical peptide synthesis, ft has been possible since 
the first publication of the native chemical ligation method 
1 27 1 to covalently join synthetic peptides ro proteins 
expressed with an umino-terminal cysteine residue. The 
newer 'expressed protein ligation' method accomplishes 
the inverse, allowing the thiocster-mediated native chemi- 
cal ligation of expressed polypeptides to synihciic peptides 
containing an amino-terminal cysteine residue |X.> , \N4|. It 
takes advantage of an ingenious defective intein ex press it in 
vector that generates a transient thioester in a reenmbi- 
nantly expressed protein. In this way. synthetic peptides 
containing amiuo-termiual cysteine residues can be limited 
with reasonable efficiency to the carboxy-terminal of pro- 
reins produced by standard recDNA-exprcssion in microor- 
ganisms. Mechanistic studies indicate that expressed 
protein ligation proceeds according to the thioestcr-mediat- 
ed native chemical ligation mechanism |W|. Together 
with the Ycrdinc laboratory's application of the original 
native chemical ligarinn method |85], expressed protein lig- 
ation makes possible the introduction of precise chemical 
modifications to a range of clone d/cx pressed proteins. 

Novel applications 

Protein signature analysis 

The principles of combinatorial chemisrry have also been 
applied to protein molecules (861. I he protein signature 
analysis method involves rhe total synthesis of self-encod- 
ed libraries of protein analogs I «S 7 * J followed by a selection 
step, such as affinity chromatography, to separate function- 
al analogues from the pool of non-functional molecules. 
The [tools are decoded in one step using a chemical self- 
encoding .scheme applied ro proteins [No.K7*|. In this way, 
a single experiment generates patterns of information that 
describe the structural origins of function in the prorein 
molecule. As chemical access ro the world of proteins 
grows more robust, it can be expected that there will be 
further examples of rhe application of the principles ot 
combinatorial chemistry ro the world of proteins. 

Precision labeling 

In practical terms, one of the most useful future applica- 
tions of rotal chemical synthesis will be the site-specific 
labeling of proteins w ilh reporter entities for physical mea- 
surements and for precision assays. Chemical protein syn- 
thesis allows proteins to be precisely I a be let I at 
predetermined sites with complete \crsatitiiv 1 1 s i 1 1 y 
chemistry, it is as straightforward to introduce any of a win- 
ery of labels at one or more specific sites in a synthetic pro- 
tein as it is ro introduce a standard amino ;teid. Kxamptcs 
include rhe site-specific incorporation of biotin (Figure A). 
and the synthesis of a variety (, f fluorcNcemlv -labeled pro- 
teins, each consisting of a single molecular species. Such 
piccKion-lahelcd proteins, not readiK obtainable using 
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PnnciplfiK of solid phase chemical ligation. The oarboxy-terminal unprotected peptide segment is attached by a deavabte linker moiety to a water- 
eompatihh; ploymer support. Sidecham-unprotcctfid peptide"thioesl«r segments are sequentially reacted by native chemical ligation, to yield the 
lull Innqth polymer bound polypeptide. The polypeptide product is released from the support, purified and folded to give the target protein. 



iecl)\.\ uvhnii|iu:s, :ircr invaluable reagents fur a variety 
.it' biological studies and for assay development. 

Prufetn families 

Anoihcr useful ;i ppliiarion nf chemical protein synthesis, is 
die rapid preparation of families of proteins directly from 
sequence d;ita. ( 'hcoiical incorpnra-l ion of NMU probe 



nuclei or modified amino acids with suitable X-ray scatter- 
ing properties, such as |ISc]Mer. enables the rapid eluci- 
dation of protein structures. 

Membrane proteins 

Finally, the application of chemical protein synthesis to 
provide direct access to integral membrane proteins is an 
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ini nailing possibility. Membrane proteins constitute 
iD-MY/r of known proteins across ti variety of genomes, 
bused on ;in:i lysis of open rending frames [88]. 'I 'hey func- 
tion, ;imonj» other things, ns ion channels, transporter sys- 
tems, and receptor molecules. Very little is known -about 
the sin ten ir al basis of function for this important class of 
pmieins because, to date, inregrul membrane proteins hav e 
resisted attempts by rccDNA-basccj expression ro produce 
the lar^t: amounts i"if high [unity protein required for hi^h 
resolution sirnciure u'etci miiuuion. It may be that total pro- 
rein synthesis by chemical ligation will provide alternative 
strategies for the production of large amounts (tens-to-hun- 
drcds of milligrams) of the integral membrane proteins. 
This would have a profound effect on biological research. 
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■ Abstract In just a few short years, the chemical ligation of unprotected peptide 
segments in aqueous solution has established itself as the most practical method for the 
total synthesis of native_proteins. A wide range of proteins has been prepared. These 
synthetic molecules have led to the elucidation of gene function, to the discovery of 
novel biology, and to the determination of new three-dimensional protein structures by 
both NMR and X-ray crystallography. The facile access to novel analogs provided by 
chemical protein synthesis has led to original insights into the molecular basis of protein 
function in a number of systems. Chemical protein synthesis has also enabled the 
systematic development of proteins with enhanced potency and specificity as candidate 
therapeutic agents, 
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INTRODUCTION: Protein Science in the Postgenome Era 

An important current objective in biomedical research is to understand the molecu- 
lar basis of the numerous and intricate biological activities of proteins and therefore 
to be able to predict and control these activities. The importance of this goal is 
dramatically increased today because of the explosive success of the genome- 
sequencing projects, which have revealed hundreds of thousands of new proteins, 
but only as predicted sequence data (1). For the biologist, elucidation of the bi- 
ological function of a predicted protein molecule is thus a challenge of great 
significance. In the words of Freeman Dyson, "[In the post-genome era], proteins 
will emerge as the big problem and the big opportunity. When this revolution oc- 
curs, it will have a more profound effect than the Human Genome Project on the 
future of science aiid medicine" (2). 

For the past 20 years, most studies of the molecular basis of protein action have 
been carried out by recombinant DNA-based expression of proteins in genetically 
engineered cells (3). From its introduction, this powerful method revolutionized 
the study of proteins by enabling the production of large amounts of proteins of 
defined molecular composition and by allowing the systematic variation of the 
amino acid sequence of proteins (4). Expression of proteins in engineered cells 
is now a mature technology, and its scope and limitations are well understood: 
(a) Small proteins (i.e. <30 kDa) are easier to express than large, multidomain 
proteins; (b) folding of large-protein molecules can also be a challenge; (c) product 
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heterogeneity is frequently a problem, caused by uncontrolled processing of the 
nascent polypeptide in the cell; and (d) the overexpression of proteins that are toxic 
to the cell, such as proteases, can be difficult (5). 

Additionally, because the cell is used as a protein factory, such molecular bi- 
ology studies are inherently limited to the 20 genetically encoded amino acids. 
Efforts have been made to use cell-free synthesis to expand the repertoire of ri- 
bosomal synthesis to include noncoded amino acids as building blocks (6, 7), 
These attempts to incorporate other amino acids have had very limited success — 
obtaining adequate amounts of pure protein from the cell-free translation systems 
can be a significant challenge (8), and many unnatural amino acids are simply not 
compatible with ribosomal polypeptide synthesis (9). 

Chemical synthesis is an attractive alternative to biological methods of protein 
productioa The use of synthetic chemistry promises the unlimited variation of 
the covalent structure of a polypeptide chain with the objective of understanding 
the molecular basis of protein function. Chemistry also promises the ability to 
systematically tune the properties of a protein molecule in a completely general 
fashioa 

This vision was one of the prime imperatives of organic chemistry in the time of 
Emil Fischer at the beginning of the 20th century. In a 1905 letter to Adolf Baeyer, 
Fischer wrote, "My entire yearning is directed toward the first synthetic enzyme. 
If its preparation falls into my lap with the synthesis of a natural protein material, 
I will consider my mission fulfilled" (10). In the decades since then, the challenge 
of applying the methods of chemistry to the study of protein action has stimulated 
numerous advances in synthetic methods. Historically, these advances included 
the use of novel reversible protecting groups (11), novel activation methods for the 
formation of covalent bonds (12), and even polymer-supported synthesis (13), all 
of which sprang from the drive to apply the science of chemistry to the study of 
proteins. 

DOMAINS: Building Blocks of the Protein World 

Because proteins are large molecules, applying chemical synthesis to them is a 
considerable challenge. Furthermore, the biological functions of proteins orig- 
inate in the tertiary structure of the protein molecule — that is, in the precise 
three-dimensional folded structure of the polypeptide chaia The typical protein 
molecule is ~30 kDa in size and consists of two ~15-kDa domains (14-16); 
each domain lias a polypeptide chain length of ~ 130 (±40) amino acids (14-16). 
Protein domains are defined as autonomous units of folding and, frequently, of 
function (17, 18). As such, domains are the building blocks of the protein world. 
The challenge confronting the chemist is, first, the total synthesis of folded do- 
mains and then the ability to stitch these domains together to build complex protein 
molecules. 
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CHEMICAL PROTEIN SYNTHESIS: The State 
of the Art in 1990 



Since last reviewed in this journal (19), total chemical synthesis of native pro- 
teins has made a number of important contributions to biomedical research. It is 
notable that the Kent laboratory at the California Institute of Technology used total 
chemical synthesis based on predicted gene sequence data to cany out pioneering 
studies of human immunodeficiency virus 1 (HIV-1) protease enzyme (20). The 
existence of this virally encoded aspartyl proteinase had been postulated based on 
an analysis of viral nucleic acid sequence data, and molecular genetic studies had 
« § indicated that its action in processing the gag-pol polyprotein was essential to the 

| I viral life cycle (2 1). For this reason, the HIV-1 protease was, early on, proposed as 

1 1 an important target for drug development The first preparations of the enzyme of 

| g defined molecular composition were produced by chemical synthesis (22), using a 

« ^ highly optimizedversionofstepwisesohd-phasepeptidesyntliesis(19). Thiswork 

| £ proved that the active form of the HIV-1 protease was a homodimer consisting of 

two identical 99-residue polypeptide chains, and it showed that the chemically 
| synthesized enzyme accurately processed the putative cleavage sites in the viral 

gag-pol translation product (22). 
| % In a strikingly important contribution, total chemical synthesis was also used to 

f | prepare large amounts of homogeneous enzyme for the determination of the orig- 

| & inal crystal structures of the HIV-1 protease molecule (Figure 1). The structure of 

eg I the unliganded synthetic enzyme (23) corrected a seriously flawed low-resolution 

£ s structure (24) that had been obtained by using protein derived from recombinant 

X & expression in Escherichia coli. Even more significantly, use of chemically synthe- 

S g sized enzyme provided the first high-resolution cocrystal structures of the HIV-1 

§ g protease molecule complexed with substrate-derived inhibitors (25-27). These 

g p ft structural data were made freely available to the research community and formed 

•| \ the foundation for the successful worldwide programs of structure-based drug de- 

cs 3 sign (28) that led to the development of the highly effective protease inhibitor class 

£ & of acquired immune deficiency syndrome therapeutic agents (29). 



o 

T1 O 



SYNTHETIC-PEPTIDE CHEMISTRY: Useful but Bounded 



Despite successful syntheses of the HIV-1 protease (22) and of a limited number 
of other proteins (30-35), at the start of the decade of the 1990s total chem- 
ical synthesis, by the standard methods of peptide chemistry of even a small 
protein molecule remained a daunting task, often requiring large teams and tak- 
ing years to complete, with no guarantee of success. The routine, reproducible 
preparation of synthetic polypeptides of defined chemical structure was limited 
to products of ~50 amino acid residues (19; Figure 2). This size limitation ap- 
plied equally to synthesis by solution or by solid-phase methods, but for differing 
reasons. 
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Figure 1 Crystal structures of chemically synthesized HIV-1 protease. These were the 
original high-resolution structures (23, 25-27) of this protein and guided the subsequent 
drug design programs. The synthetic protein preparation used for X-ray crystallography 
contained L-a-amino-M-butyric acid residues in place of the two Cys residues in each subunit. 
(Left) Molscript representation of the synthetic enzyme in complex with the substrate- 
derived inhibitor MVT101 (25). (Right upper panel) 2Fo-Fc electron density map for the 
side chains of the unnatural amino acids used to replace the two Cys residues in each subunit 
of the synthetic enzyme (23). (Lower panel) Side chains of the L-a-amino-w-butyric acid 
residues superimposed on the mercury atoms from Cys- containing enzyme (24) that has 
been crystallized in the same space group. This shows that the side chains of the unnatural 
amino acid have the same conformation as the natural Cys side chains. (Adapted from 
References 23 and 25). 
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Figure 2 Historical progress in the size of synthetically accessible polypeptides. 
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Classical solution synthetic chemistry involves the preparation of fully protected 
peptide segments and their subsequent condensation in organic solvents for the 
convergent synthesis of large polypeptides (36). The problems associated with 
this classical approach have been summarized (19). These limitations include 
the laborious and technically demanding preparation of the protected segments, 
the lack of general, highly resolving methods for the purification of protected 
segments, and the inability to directly characterize fully protected peptides — even 
by modern analytical methods. 1 In addition, it became apparent that fully protected 
polypeptide chains frequently had only limited solubility in organic solvents that 
are useful for peptide synthesis. This poor solubility made such protected peptide 
segments difficult to work with, and the low concentrations attainable for reacting 
| segments often led to slow and incomplete reactions (37, 38). 

| g By contrast, unprotected peptide segments usually have good solubility proper- 

I § ties, are more easily handled, and can be directly characterized. The most efficient 

| way of making unprotected peptides is stepwise solid-phase peptide synthesis 

| £ (SPPS). This ingenious chemical synthesis method, the progenitor of all polymer- 

ia supported organic chemistry, was introduced in 1963 by Merrifield (13). Both the 

principles and the practical aspects of SPPS have been thoroughly described (19). 
| By the end of the 1980s, it was possible by highly optimized stepwise SPPS (19) 

to make, in good yield and high purity, essentially any peptide <50 amino acids in 
length. Reverse -phase high-pressure liquid chromatography methods could be rou- 
| M tinely used to purify these synthetic products and to evaluate their homogeneity 

Q | (39). More recently, electrospray mass spectrometry has provided a straightfor- 

£ g ward general method for the precise characterization of the covalent structure of 

g * unprotected synthetic peptides (40) . Despite the extraordinary power of solid-phase 

$ i peptide synthesis, lack of quantitative reaction eventually leads to the formation 

| w of significant levels of resin-bound byproducts. It is this statistical accumulation 

^ £ of coproducts that limits the ultimate size of high-purity polypeptides of defined 

ja covalent structure that can be effectively prepared in this way. 

| £ Thus, synthetic peptide chemistry, whether by stepwise SPPS (19) or by solution 

> gj methods (36), can provide routine access to polypeptide chains of ~50 amino acids. 

& J This corresponds to only the very smallest proteins and protein domains. 

| =^ A number of attempts were made to take advantage of the ability to make, 

"° characterize, and handle unprotected peptides (41-44). Noteworthy is the devel- 

opment of enzymatic ligation methods for the preparation of large polypeptides 
from synthetic peptide segments, with ligase enzymes specifically engineered for 
this purpose by the methods of molecular biology (45). Ironically, the princi- 
pal obstacle to general utility of enzymatic ligation has proven to be the limited 
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^or example, electrospray mass spectrometry has become one of the most useful tools for 
determining the covalent structure of peptides (40); this powerful method involves direct 
ionization of an analyte from aqueous solution. The efficacy of this ionizaton depends on the 
presence of multiple ionizable groups in the molecule under study. The lack of such groups 
in fully protected peptides precludes direct analysis by electrospray mass spectrometry. 
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solubility of even the unprotected peptide segments, under the physiological con- 
ditions compatible with the enzymes used (46). Despite considerable efforts and 
some notable successes (47), such methods have not found widespread use. 



CHEMICAL LIGATION OF UNPROTECTED 
PEPTIDE SEGMENTS 



As recently as 1991 (48), the challenge remained: namely, to develop methods that 
enable the general application of the tools of chemistry to the world of the protein 
§*ij molecule. It was evident (41-44, 48) that a truly useful approach to chemical 

| Z protein synthesis would be based on the ability to routinely make unprotected 

> * peptides <50 amino acid residues in length and would consist of a practical way to 

§ § stitch such synthetic peptides together to give polypeptides of any desired length, 

I & and hence the corresponding folded protein molecules. 

•3 £ Based on this premise, in the early 1990s the principle of chemoselective re- 

I g action (49) was adapted to enable the use of unprotected peptide segments in 

chemical protein synthesis (50). This novel "chemical ligation" approach relied 
|o on a conceptual breakthrough, the principles of which are shown in Figure 3. 




analogue structure 



Figure 3 Principles of chemical ligation (48, 50). Uniquely reactive functionalities are in- 
corporated into each peptide by chemical synthesis. Mutual chemoselective reaction of these 
moieties allows the use of completely unprotected peptide segments, which are prepared by 
standard means and can be readily purified and characterized by sensitive, high- resolution 
methods. Reaction is carried out in aqueous solution in the presence of chaotropes, such as 
6 M guanidine-HCl, to increase the concentration of reacting segments and speed up the 
reaction. The product polypeptide is obtained directly in final form. 
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TABLE 1 Chemistries used for the synthesis of native proteins by 
chemical ligation of unprotected peptide segments 

Chemistry Reference 



3 £; 



: -.3 



3 d The price paid for such unprecedented synthetic convenience, at least in the. 

£ | initial stages of development of the method, was the formation of an unnatural 

§ J[ structure at the site of ligation between two peptide segments (50). However, 

S ^ these unnatural structures are often well-tolerated within the context of a folded 

| ^ protein, and numerous examples exist of fully active protein molecules that are 

*§ J chemically synthesized in this way. Some early examples of proteins prepared by 

S j the chemical ligation method include (a) enzy matically active HIV- 1 protease (50) ; 

» (b) the mirror image enzyme D-HIV- 1 protease, which was prepared by a thioester- 

g s2 forming chemical ligation (52; Figure 4) and its high-resolution crystal structure 

I £ determined (20); (c) the facile total synthesis of proteinlike TASP molecules of 

unusual topology (53-55); (d) the synthesis of backbone-engineered variants of 
the HIV- 1 protease (56) to investigate the mechanism of the enzyme (Figure 5); (e) 



1. Thioester-forming ligation 


50 


2. Oxime-forming ligation 


53 


3. Thioether- forming ligation 


59 


4. Directed disulfide formation 


85 


5. Thiazolidine-forming ligation 


60,61 


6. Peptide bond-forming ligation 


62 



to O 

% z 

■g 3 

g 1 .In essence, the use of unique, mutually reactive functional groups not normally 
§ | found in peptides enabled the site-specific ligation of completely unprotected pep- 

4 o tide segments for the synthesis of large polypeptide chains. Reactions were de- 

signed to be carried out in aqueous solution, and a chaotropic agent such as 6 M 
guanidine-HCl was used to increase the solubility of the reacting peptide segments, 
| § thereby allowing the use of higher peptide concentrations to accelerate the ligation 

^ § reactions. 

1 1 This chemical ligation method has proven to be simple to implement, highly 

- * effective, and generally applicable (5 1 ) . A variety of ligation chemistries has been 

used (Table 1), and the chemical ligation of unprotected peptide segments has 
8 | provided access to a range of protein targets. 



Figure 4 Total synthesis of mirror image forms of the HIV-1 protease enzyme molecule 
(52). (Left) Unprotected ^-50-residue peptide segments are reacted by thioester-forming 
chemical ligation to give the 99-residue polypeptide chain of the HIV-1 protease monomer. 
Folding gave excellent yields of the homodimeric enzyme molecules. (Right) Reciprocal 
chiral specificity of the mirrofimage enzyme molecules, exemplified in afluorogenic assay. 
The-ligated L-enzyme acted only on the L- substrate, whereas the li gated D-enzyme acted 
only on the D-substrate. 
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Figure 5 Backbone-engineered HIV-1 protease by chemical ligation (56). (Top) Design 
of the variant enzyme. (Top left) H bonding of "water 301" by the amide — NH— of Ile50 
at the tip of each flap structure. (Top right) Sulfur atoms replacing these — NH— moieties, 
thus deleting the H-bonding potential. (Bottom) Synthetic scheme. Nucleophilic thioester- 
forming ligation, with inversion of configuration at the 'D-Ile50 5 chiral center, to give the 
desired 99- residue polypeptide, which is folded to form the homodimeric enzyme molecule. 



the synthesis of fully functional covalent heterodimers of b/HLH/Z transcription 
factors (57; Figure 6); and (J) the synthesis of receptor mimetics (58). 

These and other syntheses performed by the chemical ligation method demon- 
strated that proteins could now be made in high yield and good purity from unpro- 
tected peptide building blocks and that unnatural analogs could be readily prepared 
to investigate new aspects of protein structure and functioa 
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Figure 6 Total synthesis of a covalent heterodimeric transcription factor, cMyc-Max, 
by convergent chemical ligation (57). {Upper left) Molecular model of the covalent con- 
struct, bound to cognate duplex DNA. {Upper right) Synthetic scheme — each B/HLH/Z 
domain was assembled by thioester-forming chemi cat ligation of two peptide segments; 
these polypeptide products were then covalently linked by oxime-forming chemical liga- 
tion to yield a synthetic protein construct with two N terminals and no C terminal. {Lower 
right) Circular dichorism measurements showed that the covalent cMyc-Max construct 
folded correctly and was preordered even in the absence of cognate DNA. {Lower left) The 
covalent cMyc-Max heterodimer was active in a gel shift assay for DNA binding. Adapted 
from Reference 57 and Ferre-D'Amare AR (1995. PhD thesis). 



NATIVE CHEMICAL LIGATION 

The original ligation chemistries (50, 53, 59-61) gave a nonpeptide bond at the site 
of ligation. In 1994, based on the original principles of the chemical ligation method 
(48, 50), Dawson et al introduced an ingenious extension of the chemistries used 
for the chemoselective reaction of unprotected peptide segments — native chemical 
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Figure 7 Native chemical ligation (62). Unprotected peptide segments are reacted by 
means of reversible thiol/thioester exchange to give thioester-linked initial reaction products. 
Uniquely, the thioester-linked intermediate involving anN-terminal Cys residue (boxed) is 
able to undergo nucleophilic rearrangement by a highly favored intramolecular mechanism; 
this step is irreversible (under the conditions used) and gives a polypeptide product that is 
linked by a native amide (i.e. peptide) bond. Only a single reaction product is obtained, 
even in the presence of additional Cys residues in either segment. The product polypeptide 
is subsequently folded to give the desired synthetic protein molecule. 



ligation (62). In this method, simply mixing together two peptide segments that 
contain correctly designed, mutually reactive functionalities led to the formation 
of a single polypeptide product containing a native peptide bond at the ligation site. 
This highly chemoselective reaction is performed in aqueous solution at neutral 
pH under denaturing conditions. The chemical principles underlying the native 
chemical ligation method are shown in Figure 7. 

The essential feature of native chemical ligation is the (transient) formation 
of a thioester-linked product, as was the case in the original method (50) for the 
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synthesis of proteins by chemical ligation. In the native chemical ligation method, 
however, this initial thioester-linked product is not isolated; rather, it is expressly 
designed to undergo spontaneous rearrangement, via intramolecular nucleophilic 
attack, to give the desired amide-linked product (62, 63). The result is a completely 
native polypeptide chain that is obtained directly in final form. 

A feature of the native chemical ligation method is that ligation occurs at a 
unique N-terminal Cys residue. It does not matter how many additional internal 
Cys residues are present in either segment (62, 64). No protecting groups are 
necessary for any of the side-chain functional groups normally found in proteins, 
and quantitative yields of the ligation product are obtained, 
■g Where this exquisite selectivity originates is important; it lies in the use of rever- 

sible thiol/thioester exchange reactions to form the thioester-linked intermediate 
| *g ligation products (62, 63). The exchange is promoted by suitable thiol catalysts 

I g and is freely reversible under the neutral aqueous conditions used for the reaction. 

| £ Intramolecular nucleophilic attack to form the amide bond at the ligation site is 

| £ irreversible under the same conditions, so that, over the time course of the reaction, 

|| all of the freely equilibrating intermediates are depleted by the irreversible reac- 

tion step, giving a single polypeptide ligation product. Typical data from a native 
chemical ligation reaction are shown in Figure 8. Detailed studies of mechanistic 
aspects of the native chemical ligation reaction have been published (63, 65). 
Formation of a native peptide bond at the ligation site has been unequivocally 
| a demonstrated by a variety of methods, including chemistry (62), NMR (66), and 

^ | X-ray crystallography (67; Figure 9). A remarkable feature of the native chemical 

g ^ coupling reaction. Detailed studies have been carried out, and no racemization 

NOV 
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O <L> 

CM - 



w O 



<U z 



ligation of unprotected peptide segments is the absence of racemization in the 
coupling reaction. Detailed studies have been carried out, and no racemization 
j .jf was detected in the ligation product to a limit of < 1% D-amino acid content (68). 

BIOCHEMICAL PEPTIDE LIGATION 



Protein Splicing 



I s 

3 £ This cellular processing event occurs post-translationally at the polypeptide level 

| V> in certain classes of protein molecules, to generate a truncated final product that 

results from excision of the central portion of the initial polypeptide produced 
on the ribosome. Intein-mediated protein splicing is a biochemical 2 reaction that 



2 It can be expected that examples of chemical ligation of protein domains will be discovered 
in vivo, making use of biochemical mechanisms other than intein-mediated protein splicing. 
In a variety of phyla, the cell already makes use of polypeptide thioesters in numerous 
biochemical processes. These processes include the action of cysteine proteinases (69); the 
ubiquitination of proteins targeted forcatabolic destruction (70); the nonribosomal synthesis 
of peptides (71 ); and in the complement-mediated response to foreign pathogens (72). Given 
the obvious utility to the cell of cutting and pasting protein domains at the polypeptide level, 
it is reasonable to assume that nature will have worked out ways of taking advantage of its 
existing et tool kit" to accomplish this task by chemical ligation. 
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Figure 8 Raw analytical HPLC data from the synthesis by native chemical ligation of the 
110-residue polypeptide chain of the enzyme barnase (63). The two unprotected peptide 
segments, (l-48) a COSbenzyl and Cys49-1 10, were reacted in aqueous solution at pH 7 in 
the presence of a thiol catalyst. Data after 7 h show a nearly quantitative reaction to form 
a single product. (Insert) Electrospray mass spectrometric data for the ligated 1 10- residue 
polypeptide, M w 12,343. 

has considerable utility in, its own right, and it is the subject of another chapter in 
this same volume (73), It is interesting that the publication of the native chemical 
ligation method (62) in 1994 strongly influenced the subsequent elucidation of crit- 
ical aspects of the biochemical mechanism of natural protein splicing, as described 
by Xu et al (74). Based in part on an appreciation of the thioester-mediated acyl 
shift mechanism that had already been defined for the synthetic native chemical 
ligation reaction (62), protein splicing was shown to proceed via (intein-mediated) 
formation of (thio)ester-linked intermediates, followed by nucleophilic attack to 
form the final amide-linked spliced polypeptide (73). 

In fact, the mechanisms of native chemical ligation and intein-mediated protein 
splicing are quite distinct in certain critical respects, despite the shared features 



Figure 9 Crystal structure of the synthetic protein Eglin C. (Left) Tserine protease inhibitor 
Eglin C (orange) complexed with recombinant subtilisin (green) (67). (Right) Structure of 
the bond formed at the site of native chemical ligation in the synthesis of the Eglin C protein. 
The newly formed amide bond is defined by continuous electron density in the 2F Q -F C map. 



Adapted from Reference 67. 
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of involvement of (thio)ester-linked intermediates and the final step of an S- or 
O- to N-aeyl shift to form the amide-linked product Both the mechanistic basis 
of selectivity and the thermodynamic driving force for the ligation reaction dif- 
fer between the two processes. In intein-mediated splicing, the precise site of 
joining the N-extein and C-extein peptides is biochemical in origin and arises 
from spatial juxtaposition of the reacting residues, which is brought about by the 
folded conformation of the intein protein domaia By contrast, in native chemical 
ligation, initial reaction products may involve every thiol functionality in the react- 
ing peptide segments (64); the exquisite selectivity originates in freely reversible 
thiol/thioester exchange among these initial products, followed by irreversible re- 
arrangement of just one intermediate to give a single, defined reaction product 
(62). 

The thermodynamic driving force also has distinct origins in the two processes. 
The starting point for intein-mediated splicing is amide bonds within a single 
polypeptide chain, and the reaction yields a ligated product and a peptide frag- 
ment (73). Thus, the driving force for the biochemical splicing reaction is actually 
the same as for solvolytic cleavage of peptide bonds — the generation from an 
uncharged amide of an ionized moiety (perhaps two) with favorable solvation 
properties. There may also be a contribution from the distorted (high energy) state 
of the starting amide bond induced by the folded structure of the intein-containing 
protein (73). For native chemical ligation, the reaction of apeptide-thioester with an 
amine to form an amide (i.e. peptide) bond is strongly favored on simple enthalpic 
grounds. Because of these critical mechanistic differences in thermodynamic driv- 
ing force and selectivity of reaction, it is an oversimplification to describe native 
chemical ligation as "biomimetic" (75). 

Use of defective-intern expression systems as a route to the preparation of 
peptide -thioesters for use in native chemical ligation is discussed on p. 951. 

Conformationally Assisted Ligation 

In some cases, folding conditions can be used to accelerate the rate of native 
chemical ligation (76). Many proteins can be cut into two or more polypeptides 
that can be reconstituted to form a nativelike conformation. In these cases, the 
revealed N and C terminals of the peptide fragments are located in close proximity 
at the site of chain scission. This greatly increases the collision frequency, and a 
weakly activated C-terminal group such as a thioester can be used to religate the 
fragments. Total synthesis of proteins using this approach has been demonstrated 
with the chymotrypsin inhibitor CI2 (Figure 10). When two synthetic peptide seg- 
ments spanning the CI2 molecule, one incorporating a C-terminal thioester and 
the other an N-terminal cysteine, are mixed together under folding conditions, 
conformationally-assisted ligation proceeds in <2 min, compared with several 
hours for chemical ligation under denaturing conditions (76). In suitable sys- 
tems, the peptide- a thioester segment can even be mixed under folding conditions 
with a version of the other peptide segment without a Cys at the N terminal, and 
conformationally-assisted ligation still proceeds in a matter of hours. 
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Figure 10 Conformational ly assisted chemical ligation, exemplified for the chymotrypsin 
inhibitor C12. (4 left) Thioester-mediated chemical ligation at Cys under standard dena- 
turing conditions occurs over several hours. (B, right) The same ligation reaction under 
folding conditions, in which the two segments associate to increase the collision frequency 
between the reacting functionalities, is complete within 3 min (76). 

This conformationally assisted chemical ligation extends previously devel- 
oped semisynthetic approaches that used other weakly activated ester groups for 
C-terminal activation of a peptide segment (77). 



SCOPE OF NATIVE CHEMICAL LIGATION 
FOR THE SYNTHESIS OF PROTEINS 

The broad scope of the native chemical ligation method for the total synthesis of 
proteins is summarized in Figure 1 1 and by the data shown in Table 2. 

The first applications of native chemical ligation were to small, Cys-rich pro- 
teins such as disulfide-cross-linked secretory proteins or the zinc-finger proteins. 
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Figure 11 Applications of native chemical ligation. 



In all, >300 biologically active proteins from >20 different families have been 
successfully prepared by total chemical synthesis with this method. These are still 
early results in what will surely be more widespread application of the method, but 
they demonstrate routine synthetic access to single-domain proteins and suggest 
that native chemical ligation will provide the basis of a general synthetic access to 
the world of proteins. 



FOLDING SYNTHETIC PROTEINS 

The activity of a protein molecule originates in the precise tertiary structure of 
the folded polypeptide chain. To complete the total synthesis of a functional pro- 
tein molecule, the synthetic polypeptide chain that corresponds to the sequence 
of the protein must be folded to form the correct three-dimensional structure. Our 
intriguing experience to date has been that chemically synthesized polypeptide 
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TABLE 2 Selected proteins prepared by total synthesis using native chemical ligation 8 

Protein class/families b Protein molecular mass (kDa) Polypeptide size (aa) 
Secretory 

Chemokines 8-10 -70 

Cytokines 15-20 —160 

BMPs -25 2 x 115 

| I Ser PR inhibitors 6-8 58-70 

Agouti proteins 6-12 50-112 

3 g AFP -6 -50 



8. 



Anaphlyatoxins —8 —70 



*c3 

Is EGFs/TGF-o! -6 -50 



SH2 domains -10 -90+ 



5 £ 

oo 

5^ Receptor/membrane 

|i & 2 microglobulin 12 99 

c 

■g.° glplrN-term domain 14 120 

-o « 

§ 5 Influenza m2 50 4 x 97 

*i« 

1 5 Intracellular 

. g 

o a 

<8 <g SH3 domains —7 —60 

b/HLH/Z 16-20 2 x 70-1: 

2*5 Zn-finger -8 -70 

8 | Redox 

E ^ Desulforedoxin 8 2 x 36 

1| Rubredoxin 6 53 

CO J 

> 



Cyt b5 10 82 
^ J Enzymes 

1^ Retroviral proteases 20 2 x 99-116 

Secretory PLA2s 14 -120 

MIF 39 3 x 115 

Barnase 12 110 

"Research scale synthesis typically gives 50-1 00 mg of each protein; each of the above proteins had the expected biochemical 
or biological activity; three-dimensional molecular structures were determined by NMR or X-ray crystallography for many 
of these proteins. 

b BMP, bone morphogenetic protein; AFP, anti-fungal protein; EGF, epidermal growth factor; TGF-a, transforming growth 
factor-a; MTF, macrophage migration inhibitory factor. 
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TABLE 3 Some structural motifs successfully folded 
as synthetic proteins 3 


Chemokine fold 


SH3 


Ser protease inhibitor fold 


PLA2 (14-kDa form) 


Kringle fold 


Cytokine fold 


Agouti Cys-rich domain 


TGF-0 fold 


EGF-fold 


b/HLH/Z DNA-binding domains 


SH2 


Rubredoxin 


2n-flngers 


40T/MIF 


Aspartyl protease fold 


Chi tin binding domains 


Anaphylatoxins 


Ion channels 



"EGF, Epidermal growth factor; TGF-^, transforming growth factor 40T/MIF, 
4-oxalocrotonate tantomerase/macrophage migration inhibitory factor. 



chains fold efficiently in vitro to give fully functional protein molecules (78). Such 
synthetic proteins have unique, defined folds of the polypeptide chain, as shown 
by NMR measurements (79, 80) and by X-ray crystallography (67, 79, 81; Figures 
12 and 13). Some examples of structural. motifs successfully folded as synthetic 
proteins are given in Table 3. 

Thus, correct folding of synthetic proteins is efficient, accurate, and general at 
the level of single domains. 

A growing list of multidomain proteins have also been successfully produced 
by the folding of chemically synthesized polypeptide chains (e.g. see Figures 1, 
6, and 14). These proteins include homodimers (22,23), heterodimers (20,57), 



Figure 12 Crystal structure of AOP- 
RANTES (79). This chemically mod- 
ified chemokine protein was prepared 
by total synthesis, using native chemical 
ligation. X-ray diffraction was used to 
determine the structure to 1.6-A resolu- 
tion. (Top) Ribbon structure of the crys- 
talline dimer. (Bottom) 2F 0 -F C electron 
density map corresponding to the unnat- 
ural aminooxypentane oxime moiety. 




3 




AMIMOOXrFENTAN-E 
OXIME 
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Figure 13 Three-dimensional crystal structure of the mirror image protein molecule, D-HIV-1 
£ £ protease (20). The protein was prepared by thioester-forming chemical ligation of peptide segments 

synthesized with D-amino acids (52). (Left) Molscript representation of the ligated chemically 
g g synthesized D-protein molecule, displayed as the L-form for comparison purposes (20). (Right) 
(§ c Molscript representation of recombinant L-HIV-1 protease, prepared in E. coli (81a). The close 

« ^ similarity of the folded structures of the synthetic ligated D-protein and the recombinant L-protein 

oS is clearly evident. 

i| 

S J and hexamers (82), as well as proteins containing two (57) and even three (82a, 

8 | TM Hackeng, JA Fernandez, PE Dawson, SBH Kent, JH Griffin, submitted for 

g J publication) domains in a single polypeptide chaia The successful syntheses of 

g | such proteins suggests that this ability to accurately fold synthetic polypeptide 

| ^ chains may hold true both for single domains and for more complex proteins. 

| J Certainly, in vitro folding, in which the system contains only a single homoge- 

B 3 neous polypeptide of defined covalent structure, is utterly distinct from and much 

| |* simpler than the situation in vivo, in which the complex intracellular environment 

d & contains multiple interacting protein species at high local concentrations. In con- 

| £ sequence of this complexity, it has recently been found that the cell possesses 

a sophisticated "chaperone" apparatus that is involved in protein folding in vivo 

(83). 

In chemical protein synthesis, the folded protein molecule is formed only at 
the final stages of production, under carefully controlled laboratory conditions. 
Control of the folding process can be particularly important in the production of 
proteins that are toxic to the cell, such as proteolytic enzymes. Using chemistry, 
it is possible to keep the polypeptide unfolded and inactive until after ligation 
and purification, when folding can be carried out in the presence of an inhibitor. 
This control over enzymatic activity was one of the key features of the success of 
chemical protein synthesis in the early work on the HIV-1 protease (20). 
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Figure 14 Total chemical synthesis of a multidomain protein: a tethered-dimer form of the HIV- 1 
protease. (Left) Convergent ligation synthetic strategy (85). Thioester-forming ligation was used 
to make each of the 101-residue monomers, and then directed disulfide formation was used to 
make the 202-residue synthetic dimer. The — NH— moiety of Ile50, in one monomer only, was 
substituted for by an — O— by incorporation of an ester at that position in chemical synthesis of 
the peptide segment (20). (Right) Analytical data. (Top right). HPLC, showing high-purity product 
and autolysis fragments consistent with the full enzymatic activity of the backbone-engineered 
22-kDa protein. (Bottom right) Electrospray mass spectrometry data showing the high purity and 
correct observed mass of the synthetic construct. 



CASE STUDIES IN THE APPLICATION OF CHEMICAL 
PROTEIN SYNTHESIS 

Noncoded Amino Acids 

Using chemistry to make proteins, it is straightforward to introduce an almost 
unlimited range of "unnatural" amino acids at any specific site(s) in a protein 
molecule. A classic example is Low's investigation of the "second shell" effects 
on the redox potential of an iron-sulfur protein, by systematic substitution of non- 
coded amino acids (84). Demonstrating the power of the chemical protein synthesis 
method, large amounts of each protein analog were made, purified, and fully char- 
acterized. Another example is the incorporation of a thienyl-Ala in place of a 
His residue in the enzyme PLA2 to establish the critical function of an imidazole 
side-chain functionality in the action of that enzyme (65). With chemical synthe- 
sis, multiple substitutions can be readily made at any position of the polypeptide 
chain, enabling virtually unlimited combinations of number, type, and position of 
noncoded amino acids to be incorporated into a protein molecule (85). Such studies 
can be very informative as to the structural basis of protein function and are made 
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straightforward by the synthesis of native proteins by chemical ligation of unpro- 
tected synthetic peptide segments, yet they are extremely difficult or impossible 
by recombinant DNA methods. 

Noncoded amino acids are frequently found in native proteins in vivo. These 
arise from specific post-translational enzymatic modification of coded amino acid 
residues. One common modification of this type is y -carboxy -glutamic acid (Gla), 
found for example in the'eponymous Gla domains in plasma proteins. This modi- 
fied amino acid is not produced biosynthetically in bacteria or yeasts, which rules 
out simple expression, so chemical synthesis offers an attractive route to the prepa- 
M . ration of proteins containing Gla domains. 

§f An example is human plasma protein S, a 635-amino-acid (aa) plasma pro- 

| g tein that acts as an anticoagulant cofactor. This multidomain protein consists of 

J g an N-terminal Gla domain that contains 1 1 Gla residues, which is followed by 

| § a thrombin-sensitive region, three epidermal growth factor domains, and a sex 

| & hormone-binding globulinlike region. A polypeptide construct containing the first 

| & three domains, Gla (1-46 aa)-thrombin-sensitive region (47-76 aa>-epidermal 

8 5 growth factor- 1 (77-1 16 aa), has been synthesized from three segments, using na- 

1*5 tive chemical ligation (TM Hackeng, JA Fernandez, PE Dawson, SBH Kent, JH 

£ c Griffin, submitted for publication). Folding of this polypeptide chain produced 

Ij = w a three-domain protein, microProtein S, that displayed anticoagulant cofactor 

J 1 activity. 

9 1 Precise Covalent Modification 

5 1 The ability to prepare native proteins by total synthesis, using chemical ligation 

S g. of unprotected peptide segments, provides a convenient and general route to site- 

2 g specific modification of the protein molecule. The full range of synthetic peptide 
§ « and peptidomimetic chemistry (86) is at the command of the researcher who wants 
g . to make precise and controlled changes in a protein's covalent structure. Such 
% ?t changes are not limited by the genetic code or by the strictures of the ribosomal 
S 5 machinery. With chemical synthesis, virtually any conceivable covalent modifica- 
5 |> tion can be introduced at will anywhere in the protein molecule. An early example 

3 £ of the utility of this approach was the total chemical synthesis of (BTD) HIV-1 pro- 
| £ tease (87), a protein in which the Gly-Gly sequence found in a -turn in the native 

protein (20) was replaced by the sterically constrained bicyclic compound BTD, a 
rigid mimetic of type IT /*-turn geometry (88). The resulting enzyme showed full 
activity and a significantly enhanced thermostability (87). 

Site-Specific Tagged Proteins 

Chemical synthesis enables the specific labeling of a protein molecule at unique 
site(s). Such specific modification is less likely to perturb the structure or activity 
of the protein than uncontrolled reaction with labeling reagents that stochastically 
target all amino or other particular functional groups in the protein. Fluorescently 
tagged proteins are extremely useful tools for biology and drug discovery, and 
synthesis of native proteins by chemical ligation allows the facile incorporation of 
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fluorescent dye molecules at any desired position in a protein molecule. In a recent 
example, a fluorescent Tip analog was incorporated by chemical synthesis into the 
Ras-binding domain of the protein Raf (89). The binding properties of the native 
domain were maintained, and the unique fluorescent label permitted the study of 
extremely fast kinetics of protein-protein binding. 

With chemical protein synthesis, it is possible to tune the fluorescent properties 
of the labeled protein to the task at hand. For example, dye chelator-labeled proteins 
have been made for time-resolved fluorescence studies, in which it is possible to 
largely eliminate background emission by use of suitably "time-gated" detection 
w . (C Hunter, G Kochendoerfer, 89a). 

g g Finally, total chemical synthesis allows the ready introduction of affinity tags, 

« | such as biotin, at precise sites in the protein molecule, while preserving biological 

J « activity, again something that is straightforward with chemistry. 

1 8. 

go 



0\ Cl. 



Backbone Engineering 



Another intriguing example of site-specific modification of the protein molecule, 
^5 enabled by chemical ligation, is the covalent modification of the polypeptide back- 

| S bone itself. This type of modification is not readily achieved, if possible at all, by 

genetic-engineering means. For example, a functionally important peptide bond 
(i.e. backbone amide) in the HIV-1 protease molecule was site-specifically replaced 
1 1 by a thioester moiety in each monomer of the homodimeric protein molecule, to 

(5 -g investigate the direct involvement of that specific peptide bond in the mechanism 

of action of the enzyme, as suggested by the X-ray ciystallographic data (20, 56). 
This approach was extended to the construction by total chemical synthesis of a 
. 22-kDa covalent tethered dimer of the HIV-1 protease (20, 85), in which only one 

8 J monomer was site specifically backbone engineered (Figure 14). The results of 

8 £ these studies showed that the two flap regions of the homodimeric native HIV-1 

§ £ protease molecule work analogously to the single flap moiety in the two-domain, 

| M single-polypeptide chain, cell-encoded aspartyl proteinases (90). 

® o A similar backbone engineering approach, in which specific amide — NH— moi- 

& g eties were replaced by — O— atoms, has been used to investigate the contribution of 

§ r individual backbone H bonds to protein-protein interactions (91). More recently, 

an engineered backbone structure was introduced into bovine pancreatic trypsin 
inhibitor, by replacing one Cys residue involved in forming a disulfide bond with 
an Af(ethylmercaptan)Gly, to investigate the effects of such a substitution on the 
folding, activity, structure, and stability of the resulting protein molecule (92). In 
this novel protein analog, the side chain of the Cys residue has effectively been 
moved to the backbone amide N atom. 



Protein Medicinal Chemistry 

Synthetic access enables the systematic application of the principles of medicinal 
chemistry to the protein molecule itself. An example is the total chemical synthesis 
of the potent anti-HIV molecule AOP-RANTES (79) (Figure 12). This chemical 
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protein analog was used as a lead compound in a successful program, based on 
chemical ligation, to develop even more potent anti-HIV molecules (J Wilken, 
D Thompson, H Gaertner, O Hartley, R Fish, JM McDonnell, Q Xu, D Fushman, 
D Cowburn, N Heveker, J Picard, SBH Kent, R Offoid, manuscript in preparation). 
The chemical protein analog NNY-RANTES, which resulted from the first phase 
of this program, is > 30-fold more effective as an anti-HIV compound and has been 
shown to prevent HIV infection at low nanomolar concentrations in the huPBL- 
SCID mouse model for acuired immune deficiency syndrome (93). NNY-RANTES 
is the most potent known anti-HIV compound. It is believed to work by inhibiting 
receptor recycling (94), thus clearing CCR5 from the surface of peripheral blood 
cells, a mechanism distinct from current clinical therapies for acquired immune 
deficiency syndrome. 

Rapid Access to Functional Gene Products 

In the past few years, an important new application lias emerged for chemical pro- 
tein synthesis — to enable rapid access to functional wild-type protein molecules 
directly from gene sequence data (Figure 15). Success of the genome projects has 
resulted in the discovery of > 100,000 new proteins (1). However, these newly 
discovered molecules are known only as predicted open reading frames in genome 
sequence databases— the biomedical researcher rarely has access even to the cDNA 
clone corresponding to a particular gene, let alone the protein itself. For example, 
the recent elucidation of the complete DNA sequence of the genome of Caenorhab- 
ditis elegans resulted in the identification of 19,090 open reading frames encoding 
^7.5 million amino acid residues of polypeptide sequence (95)! The probable 
roles of many of these predicted proteins can be tentatively assigned by analogy to 
proteins of known function, using bioinformatics. Nevertheless, the precise bio- 
chemical properties of a mature gene product can only be assessed at the level of 
the protein molecule itself. 

Synthesis of native proteins by chemical ligation of unprotected peptides can 
provide access in a matter of days to large (10 + mg) amounts of functional protein 
molecules of exquisite homogeneity, based directly on gene sequence data. Secre- 
tory proteins, which are generally small and rich in Cys residues, are particularly 
suited to facile preparation by native chemical ligation. As described above, over 
the past 3 years, >300 proteins and protein analogs have been prepared by this 
method (78). These synthetic proteins have been used in a wide range of biomed- 
ical research investigations, resulting not only in the definition of gene function 
but frequently in the elucidation of novel biology (96). 

Structural Biology 

Facile access to the large (i.e. multiple tens of milligram) amounts of high-purity 
preparations produced by chemical protein synthesis can be of great utility for 
studies of protein structure by NMR spectroscopy and by X-ray crystallography. 
New methods for NMR spectroscopy have considerably enhanced the speed with 
which the structure of small (i.e. <200-aa-residue) proteins can be determined. 
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Figure 15 From gene sequence direct to functional protein, using chemical protein 
synthesis. 



Obtaining sufficient (i.e. > 10-mg) amounts of correctly folded proteins is now 
often the limiting step in structure determination. In a number of instances, total 
synthesis by chemical ligation methods has provided rapid access to high-purity 
protein samples in amounts useful for NMR studies (97, 98). 

A recent case study of the determination by NMR of the novel structure 
of a chemically synthesized protein is the C-terminal Cys-rich domain of the 
"agouti-related" protein (80), a natural antagonist of the melanocortin-4 receptor 
involved in the control of human feeding behavior In addition to small protein do- 
mains, chemical ligation approaches have contributed to the analysis of large pro- 
teins, using NMR techniques. Muir and coworkers have made use of recombinant 
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expression of N-terminal cysteine and thioester polypeptides (folded as domains) 
to label individual domains within multidomain proteins (66). By reducing spec- 
tral complexity, this approach promises to greatly simplify the NMR analysis of 
proteins that are >200 amino acids in size. 

Chemistry also enables the precise site-specific introduction of NMR probe 
nuclei into the protein molecule. Thus, for the HIV-1 protease, the single y-C 
atom of the active-site Asp side-chain carboxylate in each protein subunit was 
uniquely l3 C-labeled (99). NMR measurements in the presence and absence of 
inhibitor showed distinctive chemical shifts as a function of pH. It was possible 
to define the protonation state of the enzyme's catalytic apparatus and, from the 
unusual and dramatic chemical shifts observed, to deduce the molecular basis of . 
the enhanced nucleophilicity of one of the two Asp side chains at the active site. 
1 1 It is this "super nucleophilicity" that is the defining feature of aspartyl proteinases 

| | as a class (100). This ability to precisely define at the level of a single functional 

& group the unique molecular basis of enzymatic properties demonstrates the power 

of chemistry applied directly to the protein molecule itself, 
oo Similarly, new X-ray crystallography methods have accelerated the pace of pro- 

g 5 tein structure determination. In increasing instances, protein synthesis by chemical 

«g a ligation has been used in conjunction with X-ray crystallography to determine the 

| ? structures of novel proteins. Examples include, the chemokine SDF-la (81), the 

J .1 chemical protein analog AOP-RANTES (79; Figure 12), and the mirror-image 

| & enzyme molecule D-HIV-1 protease (20; Figure 13). 

°. | Another important application of chemical protein synthesis is in the emerg- 

| § ing genomic structural biology programs, which are aimed at the determination 

S * of the three-dimensional molecular structures of representative examples from all 

2 .g- classes of proteins encoded in a particular genome (101). Such Wgh-throughput 

§ structure determination will require access to great numbers of proteins in high 

^ f2 purity and large amount. In addition, incorporation into the protein molecule of 

js £ seleno-methionine residues is essential to also provide direct phase information 

| a from anomalous X-ray scattering on the same protein sample. Chemical protein 

> » synthesis by the methods described here is well suited to provide the proteins 

^ J needed for genomic structural biology. A pilot study has been successfully com- 

pleted in which the viral chemokine vMIP-II was prepared in [Se]Met-containing 
form and used for structure determination by both ! H-NMR (98) and X-ray crys- 
tallography (E Lolis, submitted for publication). 
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CURRENT DEVELOPMENTS 

Expressed Protein Ligation 

From its inception, the native chemical ligation method was also envisioned for use 
with peptides that are produced by recombinant means (62). There are now mul- 
tiple examples of the chemical ligation of recombinant peptides. These alternate 
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sources for suitably functionalized peptides have extended the applicability of 
the native chemical ligation method to include the world of peptides and do- 
mains that can be successfully produced by recombinant-DNA-based expression 
methods. 

N-terminal cysteine recombinant peptides can be generated either by prote- 
olytic cleavage next to a cysteine residue (102) or by an intein-based approach 
(103). These recombinant products can be reacted with synthetic peptide-thioesters 
to generate native polypeptides of hybrid biological and chemical origin. More 
recently, intein-based protein expression vectors have been adapted to generate 
polypeptide thioesters by recombinant means for use in native chemical ligation 
(104, 105). Interception of the partly rearranged splicing intermediate by a suit- 
able thiol generates a recombinant peptide-thioester (Figure 16). These peptide- 
thioester segments can be reacted by native chemical ligation with a synthetic 
N-terminal Cys peptide to generate native polypeptides of hybrid chemical and bi- 
ological origin (104-107). With the approaches described above, both the peptide- 
thioester and the N-terminal Cys peptide canbe of recombinant origia This permits 
the use of native chemical ligation for the mixing and matching of recombinant 
polypeptide segments in vitro (66). 

Use of recombinant methods to generate the necessary peptide-thioester seg- 
ments thus permits even molecular biologists who are not skilled in chemistry 
to use the native chemical ligation technique (106, 107). This "expressed protein 
ligation" 3 can be expected to lead to widespread use of the native chemical ligation 
method in biological research laboratories (109, 110). 

Solid-Phase Protein Synthesis 

The principles of polymer-supported organic synthesis (13, 19, 1 1 1) have been ap- 
plied to the chemical ligation of unprotected peptide segments in aqueous solution 
[(1 12); Figure 17]. In solid-phase chemical ligation, unprotected peptide segments 
of 35-50 amino acids (i.e. ~5 kDa each) are used as building blocks to assemble the 
target polymer-bound polypeptide by consecutive ligation on a water-compatible 
polymer support. Strategies for segment condensation in both the N-to-C and C-to- 
N directions have been used successfully for solid-phase protein synthesis (112) 
and alternative linker chemistries developed (1 12a). 

Target molecules have been constructed from as many as eight peptide segments 
by solid-phase chemical ligation [e.g. the polypeptide of the tissue plasminagen 
activator catalytic domain; M w 25,000 (W Lu, unpublished data), and the polypep- 
tide chain of the enzyme secretory PLA2 GV has beenassembled in a single day 



3 Sometimes erroneously referred to as "intein-mediated ligation" (106). It is important to 
note that, where incipient splicing of a defective intein is simply used as a way of generating a 
(recombinant) peptide-thioester, the ligation itself is not intein mediated; rather, the ligation 
reaction is standard native chemical ligation of two unprotected peptide segments (62). For 
an example of true intein-mediated ligation, see Reference 108. 
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from four peptide segments (112). It can be anticipated that solid-phase chemical 
ligation will provide a practical chemical route to proteins that contain several 
hundred amino acids (Figure 18). 

Membrane Proteins 

An important aspect of the study of proteins which have been predicted from gene 
sequence data is the integral membrane class of proteins. Computer-aided analy- 
sis of the predicted open reading frames from a number of completely sequenced 
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Figure 17 Solid-phase chemical ligation (112). Native chemical ligation of unprotected 
peptide segments and the principles of polymer- supported synthetic organic chemistry (13, 
19, 1 1 1) are applied to solid-phase protein synthesis. In the example shown, the C- terminal 
segment of the target polypeptide is attached by a cleavable linker to a water-compatible 
support. The next segment as a peptide- a thioester is reacted by native chemical ligation, 
to give the polymer-bound ligation product. After removal of the Cys-protecting group 
(PG), successive rounds of ligation can be carried out to give the polymer-bound target 
polypeptide. After cleavage from the polymer support, the product is purified and folded to 
give the target protein molecule. 



Evidence Appendix 



SN 09/710,633 



-56 - 



CHEMICAL PROTEIN SYNTHESIS 953 



ll 

1 8. 

2 [X, 

c . 

U r 

•ii 

Q S3 

<N ^ 

OS 




S 

o 
u 



1 

3no sun 
SIZE p artno a^ldisj 

Figure 18 Size of synthetic polypeptides accessible by chemical ligation. 

genomes has suggested that 20%-30% of all proteins contain membrane-spanning 
polypeptide sequences in the mature form of the molecule (113). Such integral 
membrane proteins mediate many processes in the cell, including signal transduc- 
tion, ion transport, and active transport of macromolecules to name a few significant 
biological activities, and are thus important objectives for biomedical research. Yet 
integral membrane proteins are difficult to express at high levels by recombinant- 
DNA-based methods and have proven hard to isolate in homogeneous form in 
chemically defined media (1 14). 

It is interesting that Kochendoerfer et al (115) have shown that integral mem- 
brane proteins can be synthesized in large amounts by the chemical ligation of 
unprotected peptide segments and isolated in high purity in media of defined 
chemical compositioa An example is the total synthesis of the 11-kDa proton 
channel M2 protein of influenza A virus, which forms a tetrameric ion channel 
(115; Figure 19). The M2 protein had previously proven refractory to multiple at- 
tempts at expression by recombinant-DNA-based methods (W Degrado, personal 
communication), but was readily obtained by chemical ligation of unprotected 
synthetic peptides. 



Glycoprotein Synthesis 

Recently, the Bertozzi laboratory (1 16) reported the first total synthesis of a glyco- 
protein, using native chemical ligation in conjunction with innovative methods for 
the synthesis of glycopeptide- a thioesters. One of the most important applications 
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Figure 19 Chemical synthesis of an integral membrane protein (115). The 97-residue 
polypeptide chain of the influenza M2 protein was prepared by native chemical ligation and 
folded to form the active tetrameric form. {Insert) Electrospray mass spectrometric data 
showing the desired product, mass 1 1,170 Da. 

of chemical protein synthesis will be the systematic preparation of glycoforms 
of gylcosylated proteins as homogeneous molecular species of defined covalent 
structure, to establish the role of the carbohydrate moiety in the biological function 
of the glycoproteia In the near future, we can expect to see an increasing number 
of examples of this important capability made possible by native chemical ligation 
(62) and by other chemoselective reactions (1 17). 

FUTURE DEVELOPMENTS 
Ligation Sites 

In its current form, native ligation chemistry uses a Cy s residue at the site of forma- 
tion of the new peptide bond joining two unprotected peptide segments. This means 
that, for a protein to be accessible by native chemical ligation, there must be no 
region in the polypeptide chain > 50-60 aa residues without at least 1 Cy s residue. 
Although the requirement for a Cy s at the ligation site may superficially be viewed 
as a stringent limitation of the method, it is actually less restrictive than it at first 
seems. There are hundreds of protein families with interesting biological activities, 
encompassing many thousands of protein molecules that have native Cys residues 
located in positions compatible with direct application of native chemical ligation 
(118). 
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In actual practice virtually any protein molecule can be made by native chemical 
ligation. For proteins with no suitable Cys ligation sites in the natural sequence, 
it is possible to simply put a Cys wherever one is needed for ligation, usually 
without deleterious effects on function (63, 67; see Figure 9). The work of Muir 
and coworkers is illustrative of this expedient but effective approach (66, 104, 110). 
Their chemical ligation of recombinantly expressed poly peptide-* thioesters to syn- 
thetic peptides has typically made use of an arbitrarily introduced Cys residue at 
the desired ligation site, with no deleterious effects. Also, biological researchers 
frequently insert Cys residues into a polypeptide chain to investigate the structure- 
function relationships in a protein molecule (1 19) or as a site for the introduction 
of a spectroscopic probe, such as an electron spin resonance label (120). This 
proven utility of arbitrarily introduced Cys residues provides considerable flexi- 
£ ■§ bility in synthetic design for the preparation of functional protein molecules by 

| § native chemical ligation at Cys. 

§ Additionally, it would be desirable to have' the option to use thioester-mediated 

| (2 chemical ligation at residues other than Cy s . A prototype procedure for the use of an 

.§ i auxiliary-functional-group approach to native amide-forming, thioester-mediated 

g 5 chemical ligation has been reported (121). This work defined the principles of an 

eg c effective approach to ligation at non-Cys residues, but the chemistry used had to 

| = 2 be refined and extended because severe shortcomings were observed in the orig- 

1 1 inal investigation, as revealed by studies in model systems (121). In this respect, 

| M recently reported work from the Dawson laboratory at The Scripps Research In- 

stitute may represent a more effective chemistry for ligation at residues other than 
Cys (121a), using the same auxihary-fimctional-group approach. 
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Size of Protein Targets 



gj P To date, it has proved possible to make every protein that has been attempted 

£p by the chemical ligation of unprotected peptide segments in aqueous solution, 

J -2 even integral membrane proteins. However, some targets are significantly more 

> & work than others — especially if there are multiple intermediate ligation products to 

* J handle. The recently developed solid-phase protein synthesis method (see above), 

| V, usin S polymer-supported chemical ligation (112), provides a very effective means 

"° for the ready isolation of these intermediate products and will significantly simplify 

syntheses requiring ligation of multiple segments. 

The work of our own and others' laboratories, including the laboratories of 
Ofiford (University of Geneva, Switzerland) and Muir (Rockefeller University, 
New York, NY), lias failed to show any inherent size limitations for application 
of the chemical ligation method, up to several-hundred kilodaltons in the latter 
case (122). Folding of chemically synthesized polypeptide chains to form native 
proteins, in which significant problems might have been anticipated, is usually 
straightforward for the domain size proteins made to date. Folding of complex 
multidomain proteins may or may not be as straightforward. In any event, unlike 
expression systems, chemical ligation allows the option of constructing complex 
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proteins by separately folding each domain and then stitching the folded domains 
together (123). 

Chemical Synthesis of Peptide Segments 

Virtually any target protein can be prepared by total chemical synthesis, provided 
that a suitable set of high-purity peptide-thioester segments is available. Ironically, 
for many researchers the most challenging aspect of applying the chemical lig- 
ation method to proteins is making the peptide segments. To date, the principal 
constraint on widespread application of the native ligation method has been the 
lack of methods for the facile chemical synthesis of unprotected peptide- a thioester 
segments. Fortunately, there is an abundance of expertise available for the chem- 
1 1 ical synthesis of peptides (86). The need to make large numbers of analogs of 

thousands of native proteins by chemical ligation, and hence to prepare many 
g, tens-of-thousands of peptide segments, provides an unprecedented impetus for the 

£ development of efficient methods of peptide synthesis. We can look forward with 

confidence to the development of radically improved methods for the rapid, cost- 
_ effective preparation of large numbers of unprotected peptide-thioester segments 

|S for use in chemical protein synthesis (124). 

S § 

u = 

11 

fj SUMMARY AND CONCLUSIONS 

§ g Total synthesis by the chemical ligation of unprotected peptide segments can now 

g * provide general access to native proteins of <30 kDa (Figure 18) in size. This 

8 .g* size range encompasses the structural and functional domains that are the modular 

§ building blocks of function in the protein world, from enzymes to receptors, from 

^ £ signal transduction adaptor molecules to large multisubunit protein assemblies. A 

J§ ^ wide range of different proteins has already been synthesized, leading to novel 

•J -2 biology, new three-dimensional structures, and new insights into the molecular 

> & basis of protein function. In addition, it has already been demonstrated that it is 

* J possible to stitch together, by chemical ligation folded protein domains of any size, 

5 ' promising general access to the world of proteins. 

Perhaps the most significant future application of chemistry to proteins will 
be in the creation, at will, of stable post-translational modified forms of protein 
molecules as homogeneous entities of precise covalent structure. This will enable 
the dissection at the level of the protein molecule of important biochemistry, such 
as the intracellular signal transduction pathways. It will also enable the systematic 
creation of new classes of protein therapeutics with enhanced properties. 

The stage is now set for the application of the tools of chemistry to the entire 
universe of proteins. Truly, as Edward 0. Wilson has remarked. "Where nucleic 
acids are the codes, proteins are the substance of life" (125). It is no exaggeration 
to say that understanding the molecular basis of protein action is one of the most 
important challenges of our era. The ability to apply chemistry to the study of 
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proteins, provided by the synthetic tools described in this article, will play an 
important part in addressing this challenge and will have a revolutionary impact 
on our understanding of gene function expressed through the medium of the protein 
molecule. 
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